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MEASUREMENTS ON THE LIVING. 


BY & ¢ MAHALANOBIS, M.A. (Cantab.), Professor, 
Presidency College, Calcutta. 
I. INTRODUCTION. 

1, Lack of agreement between different observers about fundamental definitions 
and technique of measurement constitutes an almost insurmountable obstacle to 
comparative studies in Anthropometry of the living. In an attempt to discover 
how far the measurements provided by different workers can be used for purposes 
of inter-racial comparisons I selected the following material : 

(i) Y. Koganei (1893). “Beitriige zur physischen Anthropologie der Aino.” 

(ii) AleS Hrdlitka (1912). “The Natives of Kharga Oasis, Egypt.” 

(i) F. von Luschan (1913). “Beitriige zur Anthropologie von Kreta.” 

(iv) T. Kubo (1913). “Beitriige zur phys. Anthropologie der Koreaner.” 

(v) Fritz Sarasin (1916—-1922). “Anthropologie der Neu-Caledonier und 
Loyalty-Insulaner.” 

(vi) L. H. Dudley-Buxton (1922). “Ethnology of Malta and Gozo.” 

(vii) S. M. Shirokogoroff (1923). “Anthropology of Northern Asia.” 

(viii) S. M. Shirokogoroff (1925). “Anthropology of Eastern China, ete.” 

(ix) AleS Hrdli¢ka (1925). “Old Americans.” 

(x) H. Lundborg and F. J. Linders (1926). “Anthropologica Suecica.” 

2. All the above-mentioned authors are well-known workers in anthropology. 
Koganei was the Professor of Anatomy in the Imperial University of Tokio, and 
Kubo the Professor of Anatomy in Taihan Hospital, Seoul, Korea, at the time they 
took their measurements on the Aino (8)* and Koreans (10) respectively. Hrdli¢ka 
was the Curator (Division of Physical Anthropology) at the United States National 
Museum, when he collected the materials for his monograph on the Egyptians of 
Kharga Oasis (4). F. von Luschan and Fritz Sarasin were trained as anthropologists 


and have published numerous important papers and books on anthropology. 
H. Lundborg and F. J. Linders are the Director and Vice-Director respectively of 


* A list of bibliographical references is given in Appendix II], Numbers within brackets refer to 
that list. 
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the Swedish State Institute for Race Biology ix Upsala. Dudley-Buxton was a 
member of the staff of the Department of Human Anatomy in the University of 
Oxford when he collected the material for his paper on the Maltese (3), and is 
at present, I believe, the Head of the Department of Anthropology in the same 
University. Shirokogoroff has published several papers on the comparative anthro- 
pology of the Far East, and appears to be an acknowledged authority in this 
subject. 


3. One would naturally expect that the works of such trained scientists would 
be free from ambiguities, and would be comparable with one another, especially 
when almost all the investigations (with one exception, that of Koganei) were 
conducted several years later than the date of the International Agreement of 
Monaco on Anthropometric Measurements in 1906 (20). To my great surprise 
and consternation I found that this was far from the actual fact. Fresh discrepancies 
in definitions or in technique continually cropped up, and I was forced to make 
a systematic comparison of the different definitions of the measurements on the 
flesh. In the present paper I have given the results of such a comparison, restricting 
myself to measurements on the living head, and to the following standard lists 
cited in one or other of the publications mentioned above. 


(i) “Notes and Queries on Anthropology” (1st edition, 1874). 


(ii) Paul Broca (1879). “Instructions générales pour les recherches anthro- 
pologiques & faire sur les vivants.” 


(iii) Paul Topimard (1885). “Eléments d’Anthropologie générales.” 
(iv) Paul Topinard (English translation, 1890). “Anthropology.” 


(v) R. Virchow (1885). List on pp. 99—1L02 of “Zeitschrift fiir Ethnologie, 
Verhandlungen,” Bd. Xvil. 


(vi) KE. Schmidt (1888). “Anthropologische Methoden.” 

(vil) “Notes and Queries” (2nd edition, 1892), 

(viii) “Schedule of Measurements in Anthropology” (British Association, 1895). 
(ix) “Notes and Queries” (3rd edition, 1899). 


(x) F. von Luschan (1906). Article on physical anthropometry in Neumayer’s 
“Anleitung zu wissen. Beobachtungen auf Reisen,” Bd. 11. 

(xi) Report of the Anthropometric Commission appointed by the XIIIth 
International Congress of Prehistoric Anthropology and Archaeology at Monaco, 
1906. 


(xi) Report of the Anthropometric Committee of the British Association 
(1908). 


(xiii) “Notes and Queries” (4th edition, 1912). 


(xiv) Rudolf Martin (1914). “Lehrbuch der Anthropologie.” 
(xv) AleS Hrdliéka (1920). “Anthropometry.” 
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Koganei gives very short definitions, which are usually insufficient, and refers 
to Virchow’s paper (31). Unfortunately this paper gives merely a list of measurements 
without any definitions and descriptive notes. Koganei however also mentions 
Schmidt's book (25), and it is probable that he actually followed it as his standard. 

Kubo, in his work on the Koreans (10), constantly refers to Schmidt (25) and 
occasionally to von Luschan (11). Kubo was a pupil of Koganei’s (Kubo mentions 
this fact in dedicating his book to Koganei), it therefore appears probable that he 
had been introduced to Schmidt’s list by Koganei himself. 


Hrdli¢ka in his work on the Egyptians (4) does not give any systematic 
definitions, neither does he refer to any standard list. The system of measurements 
described in his book on Anthropometry (5) was published several years later, in 
1920, but in this latter book he mentions that “the procedures, instruments, etc., to 
be here described, are those in regular use at the Division of Physical Anthropology, 
U.S.A. National Museum and in the field-work for the same” ((5), p. 33). As 
Hrdli¢ka had gone out to Egypt while holding the post of Curator of the same 
department it may be presumed that his measurements were taken in accordance 
with the specifications described in his book on Anthropometry (5). The work on 


“Old Americans” (6), published in 1925, may also be presumed to be based on the 
same standards. 


F. von Luschan, in his paper on the Cretans, distinctly mentions that “his 
technique was the same as that which he had originally learnt from Broca and 
which he himself, with slight modifications, has taught for three decades in practical 
courses” (translated from S. 354, Zeitschrift fiir Ethnologie, 1913). Broca’s instruc- 
tions (1) and von Luschan’s own instructions in (11) have been consulted for 
elucidating his definitions. 


Sarasin does not give any systematic definitions, but in view of his numerous 
references to Martin (14), I have assumed that Sarasin has followed Martin’s scheme, 
except in points where Sarasin definitely states otherwise. 

Dudley-Buxton mentioned in his paper ((3), p. 175) that his “measurements were 
taken in accordance with Martin’s definitions,” referring no doubt to (14). As I felt 
doubtful about certain points, Dr G. M. Morant of the Biometric Laboratory, London, 
at my request very kindly wrote to Mr Buxton and received the following reply in 
a letter dated Oxford, 5th February, 1927: 

“You will find most conveniently my technique summarized in the British 
Association Anthropometric Investigations in the British Isles, reprinted from the 
B.A. Report 1908. My measurements are, following their numbering : 

A tO. 2G. 4. M.F.D. 

B.4. OFE. 3. Tea. 3. Biz.B. 
iG Bo Ne, 4, N.B. 

EK. 1. Stature.” 


or 


> 


.E.O.B, 8. Big.B. 


Shirokogoroff mentions ((27), p. 1) “the list of measurements elaborated by the 


1—2 
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International Commission in Geneva in 1912,” and refers to the Revue anthropolo- 
gique, Nos. 7—8, Juillet—Aoitt, 1913. The Geneva Commission (7) does not however 
give any measurements on the head as its work was restricted to measurements on 
the body. But as the Geneva Commission was a continuation of the earlier Monaco 
Commission of 1906 (20) I have assumed that Shirokogoroff really intended to 
refer to this latter list. According to Shirokogoroff this list “is generally used in 
Russia,” and “the list elaborated by the Commission in England does not differ from 
this except in the measurements of the physiognomical length of the head.” From 
the fact that names used by Shirokogoroff are usually those of the B.A. Report (21), 
and that the physiognomical face length does not occur in the latter, I have identified 
his “list elaborated by the Commission in England” with the B.A. Report (21) and 
have used it in conjunction with the Monaco standard (20) in connection with 
Shirokogoroff’s measurements. 

Systematic definitions of measurements on the head are not given in Anthro- 
pologica Suecica (13) but notes are given here and there from which it is possible 


in many cases to reconstruct the definitions fairly satisfactorily. 


II. DEFINITIONS OF MEASUREMENTS OF THE HEAD (FLESH). 
(1) Head Length. 


5. Buxton and Shirokogoroff follow the B.A. Report (21): “No. A 1 Maximum 
Length. From the most prominent point of the glabella to the most distant point 
on the back of the head, known as the occipital point.” Notes and Queries (1874, 
1892, 1899, and 1912) and the British Association Schedule (1895) all give the 
same definition which is in conformity with the corresponding measurement on the 
skull. Attention is drawn to the necessity of taking the measurements in the median 
vertical plane. It corresponds to the International Commission No. 1 “ Longueur 
maxima de la téte ou diamétre antéro-postérieur maximum. Méme technique que 
pour le crane ; ne pas presser” ((20), p. 569). The technique on the skull is described 
as follows: “C’est le plus grand diamétre dans le plan sagittal et médian du crane.” 

“Points anatomiques: en avant: le point le plus saillant de la protubérance 
intersourciliere (glabelle de Broca); en arriére: le point le plus saillant du sus- 
occipital donné par le maximum d’écartement des branches du compas” ((20), p. 563). 

Sarasin presumably follows Martin’s definition ((14), p. 157): “No. 1 Grisste 
Kopflainge (diamétre antéro-postérieur maximum ou glabellaire ; maximum glabello- 
occipital length): Geradlinige Entfernung der Glabella von Opisthokranion, d. h. 
von dem am meisten hervorragenden Punkte des Hinterhauptes in der Median- 
sagittalebene.” 


Koganei also states definitely ((8), p. 254) that it is the greatest glabello- 
occipital distance, “die grésste Entfernung zwischen dem Stirnnasenwulst und 
Hinterhaupt.” 


Hrdlicka detines it as the “ maximum glabello-occipital diameter of the vault” 


5 


((5), p. 68). The procedure described by him is a little different from the B.A. Report 
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and Martin, but it does not appear likely to make any appreciable difference in the 
measurements. 

Von Luschan follows Broca’s instructions ((1), pp. 105—106): “C'est le plus 
grand écartement qu’on puisse donner au compas sur la ligne médiane du crane. 
L’une des branches est appliquée sans pression au-dessus de la racine du nez, sur 
le point culminant de la glabella ou bosse nasale; on |’y fixe avec la main gauche 
pendant que la main droite proméne l'autre extrémité du compas sur le derriére 
de la téte....” 


Kubo states that his measurement is the one called “ Langsdurchmesser des 
Schadels” by Schmidt ((25), p. 105): “Der vordere Messpunkt liegt in der Mittellinie 
der Stirn, dicht iiber dem Glabellawulst, wo dieser wenig oder gar nicht entwickelt 
ist, einen kleinen Querfingerbreit iiber dem Niveau der Augenbrauenbogen.” This 
of course is quite different from the glabello-occipital length, and is really the 
distance from a point ranging from Glabella to Ophryon and the occipital point. 
It is, however, very curious that Kubo appears to think that the Glabella and the 
Ophryon are one and the same point. He says in one place ((10), p. 179), “ich 
habe...die Entfernung zwischen Ophryon (Glabella) der Stirn und dem diussersten 
Punkte des Hinterhauptes gemessen.” Apart from the use of the Glabella within 
brackets, evidently as a synonym for Ophryon, he goes on to explain in the very next 
sentence: “Meine Kopflinge ist nicht die Projektions-, sondern die direkte 
Kopflinge von der Glabella (italics our own) bis zum hervorragendsten Punkte des 
Hinterhauptes, der durchaus nicht immer die Protuberanz ist” ((10), pp. 179—180). 
It scarcely needs pointing out how careful one must be in making comparisons. 


6. One point requires notice. The B.A. Report explicitly states ((21), p. 8) that 
“the pressure of the points of the callipers on the head should be as much as can 
be comfortably borne by the person under examination *.” Schmidt too recommends 
the application of fairly strong pressure: “Die beiden Stangenzirkelarme sind 
ziemlich stark an das Haupt anzupressen ” ((25), p. 105). 

On the other hand Broca (1) lays down that the measurement should be taken 
without pressure (“sans pression”), and von Luschan following Broca definitely 
states that his measurements were taken without pressure : “ich bei der Bestimmung 
von Linge und Breite des Kopfes gar nicht driicke” ((12), p. 334). Martin is also 
definitely of the same opinion: “ Werden die Spitzen des Instrumentes unter so 
starkem Druck, als es das Individuum aushalten kann (Vorschrift der British 
Association), an die Kopfhaut angepresst, dann wird das Mass zu klein. Eine 
derartige Messung ist aber nicht nur schmerzhaft, sondern auch ungenau und daher 
zu verwerfen” ((14), p. 157). He adds in a footnote on the same page: “ Ein 
Vergleich der Kopfmasse mit den Schiidelmassen ist nattirlich nur dann méglich, 
wenn nach absolut gleicher Technik gemessen und das Instrument nicht in die 
Kopfhaut eingepresst wird.” 

* Notes and Queries (4th edition, 1912) practically follows the B.A. Report and says ‘‘ the pressure 


between the points of the callipers should be firm, but not uncomfortable” (p. 6). Earlier British 
authorities are silent on this point. 
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The Monaco Commission distinctly says: “ne pas presser” ((20), p. 57). Hrdlitka 
translates it as “do not press” ((5), p. 21), but curiously enough adds the following 
note, “a moderate amount of pressure is of course necessary ; the instruction is 
directed against hard pressure.” I do not understand the force of “of course,” nor 
what authority Hrdli¢ka has for interpreting the International Instructions in the 
way he does. It is however clear that Hrdli¢ka’s measurements were taken with a 
moderate degree of pressure. 

Shirokogoroff does not appear to be aware of the direct opposition between the 
two authorities cited by him: the B.A. Report (21) and the Monaco Agreement (20), 
and it is uncertain what procedure he followed. The fact that he relegates the B.A. 
Report to a footnote, while he refers to the International Commission in his text, 
may be interpreted as implying that he attached greater importance to the latter. 
If this interpretation be correct then his measurements would be “ without pressure.” 
This is supported by the fact that he gives the Korean Head Length and the Head 
Breadth as 183°9 mm. and 153°7 mm. respectively against 181°4 mm, and 150°9 mm. 
given by Kubo. This amounts to an excess of 2°5 mm. and 2°8 mm. respectively 
for measurements taken without pressure *. 


7. The Maltese (Buxton), the Korean (Kubo), the Egyptian and the Old American 
(Hrdliéka) and very likely the Aino (Koganei) measurements were taken with the 
application of a certain amount of pressure and are not strictly speaking com- 
parable with the measurements on the Cretans (von Luschan), the New Caledonians 
(Sarasin) and the Mongolian people of the far east (Shirokogoroff) which were 
taken without pressure. Definite information is not available about the Swedish 
data, but assuming that no pressure was employed they would belong to the second 
group. 

Thus we find that Maltese, Aino, Egyptians and Old Americans are comparable 
with one another, as also Cretans, New Caledonians and the Mongolian and possibly 
the Swedish samples, but members of one group are not comparable with those 
belonging to the other group. The Korean Head Length stands apart and is not 
comparable with any other. 

(2) Head Breadth. 

8. B.A, Report ((21), p. 8): “ Maximum Breadth. Measured wherever it can be 
found above the plane of the ear-holes. The callipers should be held in a vertical 
transverse plane and moved about until the maximum diameter is ascertained, the 
observer being careful to keep the points of the callipers exactly opposite to one 
another.” B.A. Schedule ((24), p. 3) states that the maximum breadth is to be found 
“usually about the top of the ears,” but does not otherwise restrict the location. 
Notes and Queries ((17), 2nd edition, 1892) states that it is to be taken “ wherever 
it may be except low down behind the ears.” 

Martin ((14), No. 3, p. 159): “Grisste Kopfbreite (diamétre transversal maxi- 
mum). Geradlinige Entfernung der beiden Eurya voneinander, d. h. grésste Breite 


* 


With a mean standard deviation of about 7-0 mm. a difference of 3 mm. in one of the Head 
Lengths (due to difference of technique) will make a difference in (M 
for samples of 100, and will be definitely appreciable. 


M’)? of nearly 9 times the variance 











P. C. MAHALANOBIS 7 


senkrecht zur Median-sagittal-Ebene, wo sie sich findet....Die Messpunkte miissen 
in einer Horizontal- und Frontal-Ebene liegen.” 

Broca ((1), p- 166) mentions “Les extrémités de ce diamétre sont en général 
situées & deux ou trois travers de doigt en arriére et au-dessus du bord supérieur 
de Voreille ; mais cela est loin d’étre constant, et il faut toujours tatonner un peu 
pour trouver le véritable maximum du diamétre transversal.” Topinard ((30), p. 36) 
too mentions “above the ears,” but does not otherwise restrict the location. 

The International Commission No. 3: “ Largeur maxima de la téte ou diamétre 
transverse maximum ” ((20), p. 569). The technique is the same as on the crania: 
“Crest le plus grand diamétre horizontal et transversal qu’on puisse trouver avec le 
compas d’épaisseur sur la boite cranienne. Point anatomique, déterminé seulement 
par le maximum, mais si ce dernier tombait sur les crétes sous-temporales, il faudrait 
éviter leur saillie, en placant le compas au-dessus ” ((20), p. 563). 

Schmidt ((25), No. 31, “Breite des Schiidels,” p. 105) lays emphasis on the need 
for both arms of the instrument lying in the same horizontal plane, but does not 
otherwise discuss the question of the position where the measurement is to be taken. 


Hrdli¢ka ((5), p. 96) describes it as the “maximum breadth of the skull above 
the supramastoid and zygomatic crests.” 

As regards the location of the end-points there is substantial agreement, 
although all the definitions are not rigidly identical. In particular cases, especially 
for deformed individuals, differences may arise owing to the slight ambiguity in 
the description of the region where this measurement is to be taken. 


9. As regards pressure, B.A. Report ((21), p. 8) makes the same remarks as in 
the case of Head Length: “the pressure on the points of the head should be as 
much as can be comfortably borne by the person under examination.” Other 
authorities do not mention anything explicitly on this point; but it may be 
presumed that they would adopt respectively the same procedure as in the case of 
Head Length. On this assumption, Maltese, Korean, Egyptian, Old American, 
and very likely Aino data would form one group, and Cretan, New Caledonian, and 
Mongolian a second one, comparison being possible within either group. It is 
probable that the Swedish measurements also belong to the second group*. 


(3) Cephalic Indea. 

10. Maltese, Aino, Egyptians, and Old Americans may be compared with one 
another, so also Cretans, New Caledonians and Mongolians and possibly the Swedish. 
The Korean index is not comparable with any other, even apart from considerations 
of “pressure”. 


* With a mean §S.D. of 5:72 mm. approximately, a difference of 3 mm. in Head-Breadth (which may 
easily arise owing to the application of pressyre) will make a difference of about 14 times the variance 
in (M—M"')? in samples of 100. 

+ If the same observer uses the same amount of pressure in taking both the length and breadth 
measurements of the head, then the error in the cephalic index will be partly compensated, and will not 
generally be appreciable in samples of small or moderate size, but may become significant for large 
samples. 
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(4) Height of the Head. 

11. Koganei defines this measurement ((8), p. 254) as “die senkrechte Héhe vom 
oberen Rande der aéusseren Gehéréffnung bis zum Scheitel.” Kubo gives an identical 
definition ((10), p. 223), and his measurements are therefore comparable with those 
of Koganei’s. This is also* the definition accepted by the Monaco Commission, No. 3: 
“Hauteur de la téte (placée bien d’aplomb sur ses condyles)....Points anatomiques : 
en haut: vertex; en bas: bord supérieur du trou auditif, dont le point de repere 
(toujours 4 vérifier) est ordinairement le fond de l’échancrure comprise entre le 
tragus et l’hélix” ((20), p. 570). 

Shirokogoroff adopts a different procedure. He explains that he measured the 
total stature as well as the height of the ear-hole from the ground. “It was thus pos- 
sible to work out the height of the head by subtraction of the height of the ear-hole 
from the stature” ((27), p. 2). It is not clear which portion of the ear-hole (the upper 
border, or the centre, or the lower border) he took as his end-point. The usual 
definition of “height to ear-hole” extends to the middle of the ear-hole (e.g. Notes 
and (Queries, 4th edition, 1912, p. 4). Assuming that Shirokogoroff also measured 
to the middle of the ear-hole, his measurements of the Height of the Head would 
not be comparable with those of Kubo and Koganei. 

Hrdlicka in his monograph on the Egyptians ((4), p. 48) mentioned that he took 
the measurement “by a spreading and sliding compass. The branches...are in- 
troduced well into the auditory meati and allowed to rest on the floor. The expansion 
of the instrument is noted, with the scale held over the bregma region; the distance 
from the bregma region to the lower edge of the scale is measured by the rod of 
the ‘compas glissiére,’ and a simple arithmetical process gives the biauricular line- 
bregma height.” He explains that this gives the distance between “the line 
connecting the floor of the auditory canals to the scalp over the head.” 

In his Anthropometry ((5), p. 21), in a footnote to his translation of the definition 
given by the Monaco Commission, Hrdli¢ka says “the height from the middle of 
the line connecting the floor of the external auditory canals to bregma is now more 
in vogue,” and on p. 70 states that this method “has been practised by the author 
since 1898.” His measurements are therefore not comparable with those of either 
Koganei and Kubo, or Shirokogoroff. 

Hrdlicka mentions that his method “gives results somewhat higher than those 
obtained by Gray’s radiometer” ((4), p. 48). In fact for the same individual, 
Hrdli¢ka’s method would give the greatest, Kubo and Koganei’s the lowest readings, 
while Shirokogoroff’s value would lie between the two. 

To sum up, Kubo’s and Koganei’s measurements are comparable with each other 
but with no other. Nor are the other measurements comparable among themselves +. 

* [I am not clear that this is so, the ‘‘vertex” isthe highest point of the head above the Frankfurt 
plane, it is not necessarily identical with the “apex,” the point in the median plane vertically above the 
biauricular line. From biauricular line to vertex will usually be greater than the vertical height of the 
head. K.P.] 

{ The difference between Kvbo and Shirokogoroff for the Korean sample is 7-2 mm. With a mean 
8.D. of 613 mm. this will make a difference of more than 125 times the variance in (M-—M’)?*, 
Hrdlicka mentions in one place ((5), p. 72) that his method *‘ with due care...gives results which may 
vary within less than 3 min.”’ Assuming that a difference of even 3 mm. is made by adopting different 
procedures, a difference of nore than 12 times the variance will be made in (M/ 


M’)? for samples of 100. 





P. C. MAHALANOBIS 9 


(5) Horizontal Circumference of the Head. 


12. Koganei describes it ((8), p. 255) as the circumference measured over the 
Glabella and the furthest part on the back of the head (“iiber der Glabella und dem 
vorspringendsten Punkte des Hinterhauptes gemessen”). This corresponds to No. 45, 
Horizontalumfang des Kopfes, of Martin ((14), p. 173) which passes over the Glabella 
and the “vorspringendsten Punkte des Hinterkopfes (Opist hokranion),” and to the 
B.A. Report No. A 4 Maximum Circumference ((21), p. 8) which is described as the 
circumference “measured by passing the tape horizontally (sic) round the cranium 
at the level of the glabella in front and the occipital point behind.” 

Kubo adopts the same definition as Koganei, and says ((10), p. 212): “Ich habe 
den Umfang desselben iiber der Glabella und dem vorstehenden Punkte des Hinter- 
hauptes so gemessen, dass das Bandmass genau in einer transversalen Ebene, nicht 
auf der einen Seite héher als auf der anderen zu liegen kam.” But we have already 
seen that he believes the Glabella and the Ophryon to be identical points. This 
confusion makes his present definition a little uncertain. If he has actually measured 
it over the Ophryon then his circumference would correspond to Martin’s ((14), 


p. 173) “No. 45 a@ Horizontalumfang rund um den Kopf, jedoch iiber das Ophryon 
(statt tiber die Glabella).” 


13. The qualifying phrase “horizontal” appears to be a little misleading. Broca 
says ((1), p. 277): “Elle n’est pas horizontale, comme son nom pourrait le faire croire, 
mais toujours plus ou moins oblique, attendu qu’elle passe plus haut en avant qu’en 
arriére....On n’oubliera pas que cette courbe est un maximum.” 


(6) Nasal Height. 


14. The Nasal Height (which is a constituens of the Nasal Index) is recognised to 
be one of the most important measurements in cephalometry,and yet the most diverse 
methods appear to have been used in measuring it. Martin ((14), pp. 166—167) 
gives us no less than five different definitions under the general name “nasal height” 
or “nasal length.” The upper point of reference has been taken as (i) the Ophryon, 
Gi) the intersection (in the median-plane) of the tangent line to the two orbital 
edges, (iii) the nasion (or what is believed to be the point on the flesh corresponding 
to the naso-frontal suture on the skull), or (iv) the deepest part of the nasal bridge. 
There is greater agreement about the lower point of reference; it has usually been 
taken as the sub-nasal point (where the nasal septum joins the upper lip). But in 
at least one case (Shirokogoroff) this latter point seems to have been confused 
with the pro-nasal point or tip of the nose. The surprising thing in this connection 
is the lack of appreciation on the part of eminent anthropometrists of the need for 
noting a precise description of the definition actually adopted in practice. 

B.A. Report ((21), p. 9) gives: 


“No. C1 Nasal Height. From the nasion to the 
sub-nasal point.” 


Martin ((14), p. 166): “No. 21 Hihe der Nase (fiilschlich Linge der Nase, Naso- 


basalliinge, hauteur ou longueur du nez). Geradlinige Entfernung des Nasion vom 
Sub-nasale. 
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Broca ((1), p. 182): “No. 4 Longueur du nez. Elle se mesure en appliquant 
transversalement les deux branches de la glissiére sur la racine du nez, et sur le 
point sous-nasal.” 

Topinard ((30), p. 357): “The height is taken with the sliding compass, vertically 
from the root to the base of the nose, as on the skull.” 

The Monaco Commission ((20), p. 570) gives No, 12 “Hauteur du nez. Points 
anatomiques: en haut: nasion; en bas: sous-cloison du nez, au niveau de son union 
avec la lévre supérieure. Ne pas presser.” 

Hrdlicka himself ((5), p. 74) has a practically identical definition: “Nose: 
Length.—The length (or ‘height’) of the nose from the nasal septum where this 
joins the upper lip, to the nasion.” 

Koganei ((8), p. 255) calls it “Nasenliinge” and defines it as “die Entfernung der 
Nasenwurzel vom Ansatze der Nasenschiedewand an der Oberlippe.” Kubo calls it 
((10), p. 317, No. 34) “Hihe (Liinge) der Nase,” gives an identical definition and 
adds “die Nasenliinge ist also nicht auf dem Riicken, sondern an der Basis der 
Nase, zwischen Nasenwurzel und Nasen-Oberlippen- Winkel, zu messen,” which is 
practically a quotation from Schmidt ((25), p. 108, No. 49). This makes it clear that 
the sub-nasal point and not the tip of the nose is the lower point of reference. 

15. It will be noticed that in the above definitions the upper point of reference 
is given as the “nasion,” the “Nasenwurzel” or the “root of the nose,” and in spite 
of the verbal agreement this is where trouble begins, because there exists a complete 
lack of agreement about the precise definition of these terms. 

B.A. Report ((21), p. 6) defines the “Nasion” as “the bottom or deepest part of 
the depression between the forehead (Glabella) and the nose, or, in other words the 
most depressed part at the root of the nose*.” Other British authorities are in 
agreement on this point. Motes and Queries ((19), 1912, p. 7, No. 7) gives “Nasal 
Height—from the nasion or the most depressed point at the root of the nose to the 
sub-nasal point or angle between the septum of the nose (i.e. the partition between 
the nostrils) and the upper lip.” Notes and Queries ((18), 1899, p. 22) gives the 
following description: “...the upper point at the termination or root of the nose 
between the eyes....This last point is sometimes a little difficult to determine. 
There is a small transverse fold of the skin (sometimes two folds) at the root of the 
nose; it is on this fold, or when there are two folds, between the folds, that the upper 
point of the instrument should rest, yenerally about two millimetres above the level 
of the transverse axis of the eye.” This description is a verbatim copy from the 
second edition (1892) of the same book. The Brit. Assoc. Schedule ((24), 1895) 
describes the nasal height as the distance “from the furrow at root of nose to the 
angle between the nose and the upper lip in the middle line.” 

The British view therefore identifies, and has always identified, the “Nasion” 
with the root of the nose, defining the latter term as the most depressed part of the 
nose where the skin has frequently a fold or two. 


* It is curious however that the illustration on p. 387 (original report) shows the nasion on the flesh 
as coinciding with the naso-frontal suture on the skull. This is one example of the lack of preciseness 
in essential points found in existing standards. 
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16. Other authorities are however of quite different opinions. Martin is quite 
explicit on this point ((14), p. 129): “Die Nasenwurzel entspricht nicht der am 
tiefsten eingesattelten Stelle des Nasenriickens, die meist im oberen Teil der Ossa 
nasalia gelegen ist, sondern der Sutura naso-frontalis, deren Verlauf nach einiger 
Uebung trotz des vorhandenen Nahtgewebes und des meist diinnen M. procerus 
(M. depressor glabellae n. H. Virchow) auch am Lebenden festgestellt werden kann. 
Man findet den Punkt am besten, wenn man seine rechte Hand ruhig auf den Kopf 
des zu messenden Individuum legt und mit dem lateralen Rand der Daumenbeere 
unter leichtem Druck die Haut auf der Nasenwurzel auf und ab schiebt. Man 
beachte, dass das Nasion in der Regel im Niveau der medialen Enden der hiirenen 
Augenbrauen, meist an deren Unterrand, nicht in der Héhe der Lidspalte gelegen 
ist.” He concludes by observing: “Von manchen Autoren wird fiilschlich die am 
tiefsten eingesattelte Stelle der Nase als ‘Nasenwurzel’ bezeichnet.” 

The illustration in his book ((14), fig. 38, p. 128) clearly shows that according to 
Martin the nasion is situated at a considerable distance above the most depressed 
part of the nasal bridge. In fact Martin had stated at a different place ((15), 
pp. 393—394): “Allerdings liegt die tiefste Kinsattelung der Nase gewéhnlich nicht 
an der Nasenwurzel, sondern betriichtlich unterhalb derselben; ich habe z. B. bei 
Senoi von Ulu Gopei eine Entfernung von 7 bis 9 mm. zwischen diesen beiden 
Punkten feststellen kénnen.” 

The general tendency among continental authorities seems to be to make the 
upper point of reference on the flesh correspond to the nasion on the skull, which 
is defined as the naso-frontal suture. For example, Broca says ((1), p. 139): “La 
dépression transversale qui la sépare du neZ s appelle la racine du nez, ou potnt 
nasal ou nasion; elle correspond sur le squelette a la suture qui unit l’os frontal 
aux os nasaux.” Topinard also had definitely used the words ((30), p- 357) “as on 
the skull,” the implication clearly being some kind of correspondence between the 
points of reference on the flesh and on the skull. 

Hrdlitka ((5), p. 72) says: “The nasion should correspond as closely as possible 
to the anatomical nasion, i.e. the mid point of the naso-frontal suture.” 

Von Luschan ((11), p. 41) states: “Man fiihlt stets ee kleine Vertiefung die 
genau der Naht zwischen Nasenbein und Stirnbein entspricht.” He also notes: 
“Laien fehlen regelmiissig dadurch, dass sie die Nasenwurzel wesentlich tiefer 
suchen und etwa die Mitte der Linie zwischen den inneren Augenwinkeln zum 
Ausgangspunkt ihrer Messung machen wollen.” 


17. We thus find two sharply contrasted views about the location of the nasion, 
and hence about the upper point of reference in taking measurements of the nasal 
height. There is general agreement about the fact that the nasion on the skull is 
usually situated considerably higher up than the most depressed part at the root of 
nose on the the flesh. We have already seen that Martin found the difference as 
high as from 7 to 9 mm, in certain individuals. The anatomical work of R. Havelock 
Charles ((2), 1894) on 54 male and 8 female cadavers shows that the nasal height 
on the skull is almost certainly longer than the nasal height on the living. But 
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unfortunately Havelock Charles does not provide us with any statement of how he 
measured the nasal height on the cadaver. He talks of the visible “root” of the 
nose, but is the “root” of the nose the most receding point of the nasal bridge, 
when the head is adjusted to the Frankfurt horizontal, let us say? He does not 
write as if he had recognised that the height of the nose on the living had been 
measured in a variety of ways. When we come to Havelock Charles’s lower point 
of measurement we do not find that he has stated in the case of either skull or 
cadaver what he has taken for it. All we are told is that “Having carefully 
measured with sliding compasses, the nasal diameters of 62 ‘subjects,’ I removed 
the integuments, ete., and having cleared the naso-frontal suture and anterior nasal 
aperture, I again took the diameter *.” He does not tell us whether, in measurement 
on the subject, he took the sub-nasale, or the lowest point of the fleshy aperture, or 
whether if the former he applied pressure or not. Nor again are we told whether in 
measuring the nasal height on the skull he took as his lower point, the lowest 
point of the pyriform aperture, the base of the nasal spine, or the spine itself+. How- 
ever, Charles found the nose in the flesh to be less than the nose on the skull in height 
by —2to+16mm. It does not follow that this difference is due to the “root of the 
nose” not being the nasion, A portion of it depends on the depth of the sub-nasale 
being in the flesh below the nasal spine. Thus it seems probable that the difference 
between the two lengths is only a minimum difference of the distance from the 
nasion to the “root of the nose,” whatever the latter exactly connotes. We have for 
the 54 Panjabi males, Mean Difference = 4°33 mm. + °32, and for the eight (!) Panjabi 
females, 3:50 mm, +26. Thus it seems probable that the distance from nasion to 
root of nose is at least 45 mm. for males and 3°5 mm. for females of this race, and 
it may vary considerably with other races. As Havelock Charles emphasises nasal 
indices thus measured on the living and on the skull are not comparablet. 


18. It isa pity that Charles did not make any attempt to find out how accurately 
a point on the flesh could be located to correspond with the naso-frontal suture, To 
a layman it certainly appears doubtful whether this point can actually be determined 
with any degree of reliability. Hrdli¢ka ((5), p. 72) says in this connection: “Ina 
certain portion of subjects this point may be felt by the observer's finger nail or the 
point of a pencil$.”” But Hrdli¢ka immediately remarks: “ but in a majority we must 
rely on knowledge of its location derived from extensive observation on skulls and 
dissecting room material. It is always situated above a horizontal line connecting 
the two inner canthi.” Thus in a majority of cases (according to Hrdlitka’s own 
admission) the location of the point must be made by what is nothing else but 
guess-work, and in case of observers without any experience of the dissscting room, 
by perfectly random guess-work without any basis of previous knowledge. From the 

loc. cit. p. 2. Havelock Charles without definition uses the symbol », customary for microns, for 
millimetres. 


| He may have taken it again to the sub-nasal point, but this is not usual in measuring the nasal 
height of the skull. 


t A difference of 4:0 mm. in the mean will make a difference of more than 56 times the variance in 
(I - M1’)? for samples of 100 (with a mean §.D. of about 3-7 mm.). 


§ This is also the course recommended by Martin. 
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point of view of metrology an external or surface landmark, such as the most 
depressed part of the root of the nose, would appear to be far more satisfactory. 


19. Buxton (Maltese) followed the B.A. Report, and must therefore have adopted 
“the most depressed part at the root of the nose” as his upper point of reference *. 

Sarasin (New Caledonians) also has taken the same definitions. In his own work 
on Versuch einer Anthropologie der Insel Celebes (Wiesbaden, 1906), Zweiter Teil, 
p- 29, he says, referring to this question of the location of the nasion: “Ich befinde 
mich hier in einem Widerspruch mit Martin, welcher am Lebenden als Ausgangs- 
punkt die Stirn-Nasenbein-Sutur auszutasten sucht ; diese liegt aber gewdhnlich 
betrachtlich oberhalb der tiefsten Einsattelungsstelle der Nase. Ich halte das nicht 
fiir richtig, denn das physiognomische Nasenbild des lebenden Menschen wird, wie 
auch Martin ((15), p. 394) zugibt, die Einsattelungsstelle bedingt und nicht durch 
die Sutur am Schiidel, und dieses Bild der lebenden Nase ist es, was wir durch 
einen Index ausdriicken wollen. Auch hat es gar keinen Sinn, ein aus der Schiidel- 
anatomie gewonnenes Maass mit einem reinen Fleischteilmaass, wie es die Breite 
der Nasenfliigel ist, miteinander in einem Index zu vergleichen.” In his work on 
the New Caledonians ((22), p. 95) he says he has: “als obere Messpunkt fiir die 
Nasenhéhe die tiefste Einsattelungsstelle der Nase gewiihlt, um einen besseren 
Ausdruck fiir das Nasenbild des Lebenden zu gewissen.” 


It is however not clear what procedure Kubo (Koreans) has followed. Kubo 
constantly refers to Schmidt’s book (25) but unfortunately Schmidt is not very 
explicit on this point. Schmidt ((25), p. 107, No. 43) describes the “ Nasenwurzel” 
as “die tiefste Stelle der Einsattelung zwischen Stirn und Nase,” which apparently 
identifies the root of the nose with the most depressed part of the nose, but con- 
tinues in the very next sentence to say: “Man fiihlt hier bei feimer Haut die 
Knochensutur durch, welche die Nasenbeine vom Stirnbein trennt.” Which renders 
things quite inconclusive. Kubo also referred with approval to the following note 
by Baelz ((10), p. 253) about the difficulty of |ocating the root of the nose in east- 
Asiatic peoples: “Die Nasenwurzel hat beim Japaner meist eine andere relative 
Lage, als beim Europiier wir verstehen die tiefste Stelle des Nasensattels. Beim 
Luropiier ist diese Stelle nur wenig unterhalb der Verbindungsstelle beider 
Augenbrauen gelegen, beim Japaner dagegen ist der Unterschied sehr bedeutend, 
beim Europiier bildet der Nasensattel eine scharf markierte Stelle, eine Art Winkel, 
beim Japaner dagegen einen flachen Bogent.” This passage would suggest that 
Kubo was really seeking for the most depressed part at the root of the nose, and 
was therefore pointing out the difficulties encountered in locating this point for 
east-Asiatic peoples. On this interpretation his measurements would be comparable 
with those of Buxton and Sarasin but the legitimacy of such a comparison cannot 
be established with certainty. 


* Buxton appears to be unaware of or has neglected the divergence between the two authorities cited 
by him on different occasions, namely in (3) he cites Martin (14), but in the letter to Dr Morant already 
quoted on p. 3 above he cites the B.A. Report (21). 

+ For a careful study of the form of the nose see E. von Eickstedt (32). 
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F. von Luschan (Cretans) and Hrdlitka (Egyptians, Old Americans) on the 
other hand definitely adopted a different definition of the nasion on the living aud 
attempted to locate a point on the flesh which corresponded as closely as possible 
to the naso-frontal suture on the skull. 

It is possible that Koganei (Aino) also did the same thing, for although he does 
not mention anything explicitly in his paper on the Aino, in a later monograph on 
the Northern Chinese Soldiers ((9), p. 137) he distinctly states: “Als oberer Mess- 
punkt wurde nicht die tiefste Stelle der Einsattelung zwischen Stirn und Nase, 
welche wenigstens bei ostasiatischen Vélkern gewohnlich gar nicht der Stirn- 
Nasennaht entspricht, sondern einige mm. oberhalb derselben genommen, wo eben 
die Stirn-Nasennaht gelegen ist welche aber nur schwer durchzufiihlen ist.” 

For the Swedish data Lundborg and Linders mention Martin (14) in one place, 
and it is probable that Martin’s definition of the nasion was followed in this case. 
This hypothesis is supported by the following remarks ((13), p. 11): “.... The examiners 
often cannot locate the nasion (sutura naso-frontalis) with certainty.” In footnote 8, 
on p. 159, commenting on the Morphological Face Height we have: “ Hilden 
measured from the deepest part of the nasal root to the gnathion.” Comparing 
Hilden’s mean Morphological Face Height 123°1 (71 individuals) with Lundborg’s 
126°6 mm. (for 47,387 men) I find a difference of 3°5 mm. in favour of the latter, 
which supports the conjecture that the latter were trying to identify the nasion on 
the flesh with the naso-frontal suture on the skull*. There is another footnote, 
No. 10 on p. 159, “ Bryn located the nasion as a point 2 mm. above the deepest part 
of the nose+.” These footnotes suggest that Lundborg and Linders’ point of reference 
was different from those of Hilden and Bryn. 


20. We must now consider Shirokogoroff’s measurements. He at first uses the 
term “ Height of the Nose” ((27), p. 1), but ‘subsequently adopts ‘“ Length of the 
nose” and continues to use the latter name in most places. He is apparently not 
aware of the fact that these two names refer to two ditferent entities. The “ Nasal 
Length” is defined by the B.A. Report ((21), No. C. 3, p. 10) and Martin ((14), No. 23, 
p. 167) as the distance from the nasion to the tip of the nose, and is quite different 
from the “nasal height” which is measured to the sub-nasal point. 

A comparison of Shirokogoroff’s means with those of other authors shows that 
it is improbable that he actually measured the nasal height. For a series of 141 
Koreans, Shirokogoroff gives a mean “ nasal height” of 40°8 + °18 mm., while Kubo’s 
value is 49:0 + ‘09 mm. for 477 individuals. This gives a difference of 8°2 + °31, or 
a difference of about 26°5 times its own probable error. 

Again for Northern Chinese we have Koganei’s value ((9), p. 144) of 53°0 mm. 
for 968 individuals against Shirokogoroff’s of 41°8 for 391 individuals, giving a 
difference of 11°2 + 0°15 mm. or a difference of 80 times the probable error. Such 

* I should note however that Hilden’s measurements referred to the inhabitants of the island of 


Riino, which is believed to be populated by Swedes, while Lundborg and Linders’ measurements were 
all taken on persons of authentic Swedish extraction. 


So that we cannot be absolutely certain that the 
difference of 3°5 mm, was due to a diiference in technique and not to a difference of race. 


+ This footnote is interesting as giving still another definition of the nasion on the living. 
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huge differences are, of course, extremely unlikely to occtr in different samples of 
the same population, and they strongly suggest some fundamental differences in 
the technique. In fact for the two Korean series, for measurements which are known 
to be comparable, the next biggest difference is 0°81 mm. only for the Bigonial 
Diameter which is just over 52 times the standard deviation ; a big difference no 
doubt, but still of quite another order from the difference in nasal height. 

It is unlikely that differences as large as 82 and 11:2 mm. are due solely to a 
difference in the location of the upper point of reference, though that must be 
admitted as a possible explanation. The mean Panjabi difference found by Havelock 
Charles is 4°2 mm., but the form of the nasal bridge varies widely from race to race 
and the corresponding Chinese value may be much greater. 

The absolute magnitudes of Shirokogoroff’s “ nasal height ” are suspiciously low. 
For example for Koreans it is 40°8 mm. For certain other Chinese samples it is 
still lower, e.g. Kwantung (total) 40°0 mm., Kwantung (non-selected) 39°3 mm. The 
lowest value of the nasal height which I could find in standard literature was 
41°7 mm. for Kachin in Martin ((14), p. 451). 


21. The natural explanation suggested by Shirokogoroff’s frequent use of the 
term “length of the nose” would be that he had actually taken his measurements 
to the tip of the nose (instead of to the sub-rasal point) as his lower point of 
reference. This explanation is rendered all the more plausible by the peculiar 
history of the two names. 

In both the 2nd and 3rd editions of Notes and Queries published in 1892 and 
1899 respectively, as also in the B.A. Schedule of 1895, the names “ Length of the 
Nose” and “Nasal Length” (and not “ Nasal Height”) are used to denote the 
distance from the nasion to the sub-nasal point. Among British authorities, the 
name “nasal height” was used in its modern sense for the first time in the British 
Association Report of 1908. At the same time the name “nasal length” was also 
defined for the first time to denote the distance to the tip of the nose. Thus among 
British authorities “nasal length” would have one meaning up to 1908, and an 
altogether different meaning since that year. The same anomalous position had 
existed also in continental literature. In fact in earlier years the general tendency 
everywhere was to call the distance between the nasion and the sub-nasal point 
(and not the tip of the nose) the “nasal length.” For example Broca ((1), p. 182) in 
1879 calls it “Longueur du nez,” and Schmidt ((25), p. 108) in 1888 calls it “ Liinge 
der Nase.” On the continent, the International Commission of Monaco of 1906 
seems to have been the first authoritative body to introduce the modern name 
“nasal height” for this particular measurement. 

Koganei writing in 1893 naturally uses “ Nasenlinge,”’ while Kubo in 1918 
calls it “Hihe der Nase,” but adds the word “ Lange,” within brackets after “ Hohe,” 
evidently to link it up with the older name. Von Luschan in 1913, Sarasin in 1916, 
Buxton in 1922, and Lundborg and Linders in 1926 all use the modern name “nasal 
height.” Curiously enough Hrdlitka continued to use the older name “length” 
even in 1920, but he added the word “height” within brackets ((5), p. 74). 
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22. Summing up we find that the data considered here fall into three distinct 
groups : 

(i) Maltese, New Caledonians, and possibly Koreans, measured according to the 
British definition of the most depressed part of the nose as the upper point of 
reference. 

(ii) Cretans, Egyptians and Old Americans, and possibly the Aino and the 
Swedish data, measured according to a definition which makes the upper point on 
the fiesh correspond as closely as possible to the naso-frontal suture on the skull. 

(iii) The Mongolian data measured very likely to the tip of the nose instead of 
to the sub-nasal point. 

Members of each group are probably comparable with one another, but not with 
members of a different group. 


(7) Nasal Breadth. 


23. B.A. Report ((21), p. 10): No. C 2 “Nasal breadth. The greatest diameter, 
measured without pressure, between the wings of the nose.” This may be identified 
with Martin’s ((14), p. 162) No. 13 “Breite der Nase (Untere Nasenbreite, Nasen- 
fliigelbreite): Geradlinige Entfernung der beiden Alaria, d. h. der Punkte der 
gréssten seitlichen Ausladung der beiden Nasenfliigel voneinander.” Martin also 
mentions that the measurements must be taken without pressure (“ohne den 
geringsten Druck auf die Nasenfliigel”). Topinard had given the definition “the 
breadth from the widest portions of the alae” ((30), p. 357), and the Monaco Com- 
mission of 1906: “No. 13 Largeur du nez. Points anatomiques: face externe des 
ailes du nez” ((20), p. 571), and Hrdlitka ((5), p. 74) agree with the above definition. 

Measurements taken by Buxton (Maltese), Hrdli¢ka (Egyptians, Old Americans). 
Shirokogoroff (Mongolians), Sarasin (New Caledonians), and von Luschan (Cretans) 
are therefore probably comparable with one another. 

Koganei (Aino) and Kubo (Koreans) on the other hand call it “ Nasenbreite ” 
((8), p. 255, No. 15) and “Breite der Nase” ((10), p. 323, No. 35) respectively, and 
both define it as “der Abstand der Ansiitze der Nasenfliigel voneinander,” The use 
of the word “Ansatz” (onset, beginning) shows that this measurement is the same 
as Schmidt’s “ Breite der Nase” ((25), p. 108, No. 50): “an der Ansatzstelle der 
Nasenfliigel zu messen,” which may also be identified with Martin’s ((14), p. 163) 
No. 13a: “Grésste Breite am Hinterrand der Nasenfliigel, da, wo sie an der 
Wangenhaut festgewachsen sind.” Koganei and Kubo’s measurements while com- 
parable with each other are therefore not comparable with the first group *. 


(8) Nasal Index. 
24. Comparing the classification under (6) Nasal Height and (7) Nasal Breadth 
we get the following distinct groups for the Nasal Index: 
(i) Maltese, New Caledonians. 
(ii) Koreans. 


A difference of only 5 mm. will make a difference in (J/ — 1’) of 12 times the 8.D. in samples of 
100 (with mean 8.D.=2°9 mm.), 
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(111) Cretans, Egyptians, Old Americans. 
(iv) Aino. 


(v) Mongolians. 


Thus for such an important characteristic as the “Nasal Index” valid com- 
parison is not possible for the greater part of the data*. 


It has been shown above (Sections 6—8) that the nasal measurements of the 
living have been determined in a variety of different ways by different schools of 
anthropologists. The divergences dependent on that fact are evidently large 
compared with the inter-racial variations of the measurements, and there is no 
satisfactory method of adjusting the data so that they may be compared directly. 
The value of these statistics is greatly lessened owing to the deplorable lack of 
agreement between leading anthropologists of different countries. The nasal index 
of both the skull and the living head has been recognisea as a character which should 
be of great importance in distinguishing different races of man, and various attempts 
have been made to assemble the comparative material available. The earliest list 
of any permanent value appears to be that furnished by Topinard in 1885 ((29), 
p. 303). No references are given, but all the racial means may have been deter- 
mined in accordance with the French technique originated by Broca. Two years 
later Collignon published a more extended list, again without references}. It in- 
cluded means provided by Rudolf Virchow. The best known tables of living nasal 
indices of more recent date are those of Martin ((14), pp. 447—449), Thomson and 
Dudley-Buxtont and Deniker§. All agree in giving no references whatever. Of 
the three Deniker is the only one who comments on the fact that different methods 
of technique were used in determining the measurements. Martin’s list includes 
data provided by English (Duckworth, Beddoe, Thurston, Risley, Haddon and the 
Census of India||), French (Collignon, Deniker, Legendre, Mondiere and Chantre), 
and German (Schmidt, von Luschan, Fischer, K. Ranke and Martin) anthropologists 
and many others. Of the series compared in detail above Koganei’s Aino, Hrdli¢ka’s 
Egyptian and von Luschan’s Cretan means are quoted, and it was shown there that 
no valid comparison can be made between the Aino measurements on the one hand 
and the Egyptian and Cretan on the other. It would be easy to find other groups 
which cannot be legitimately compared. Thomson and Dudley-Buxton cite neither 


* Differences due to technique will be appreciable even in small samples. For example, with a mean 
§.D. of 7:97, a difference of 5 units in means will made a difference in (M—M’) of 4-4 times the S.D. of 
the difference in samples of 100. 

+ R. Collignon: La Nomenclature quinaire de l’Indice nasale du vivant. Revue d’ Anthropologie, 
Gime année, 3'*™me série, T. 2, 1887, pp. 8—19. 

{ Arthur Thomson and L. H. Dudley-Buxton: Man’s Nasal Index in Relation to Certain Climatic 
Conditions. Journal of the Royal Anthropological Institute, Vol. tam, 1923, pp. 92—122. The living 
nasal indices are given in Appendix I (pp. 116—119). 

§ J. Deniker: Les Races et les Peuples de la Terre, 2nd edition, Paris, 1926. The living nasal indices 
are given in Appendix III (pp. 718—721). 

| The Census of India data were obtained under Risley’s directions and it is probable that the French 
technique (as defined by Topinard) was used throughout. 
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references nor authorities for the material which they quote, but they certainly use 
the means given in the Census of India* and by Deniker, Delisle and Collignon. 
Possibly all the nasal indices given were found in accordance with the usual French 
technique, but the omission to supply any information relating to this vital matter 
is a regrettable one. Deniker (op. cit. pp. 98—99) comments on the fact that 
German and French anthropologists have determined nasal measurements in 
different ways and he says that all the means given in his list “ont été prises 
d’apres la méthode Broca-Collignon.” On turning to his table, however, we find 
several values given by E. Schmidt, Fischer and Sarasin. Of the series compared 
in detail above, Sarasin’s New Caledonian, Hrdli¢ka’s Egyptian and von Luschan’s 
Cretan means are given. It has been shown that the comparison of the first with 
the last two is quite invalid. We are obliged to conclude that no single one of these 
lists can be considered to have any statistical value until it has been carefully 
revised or confirmed. It is extraordinary that the anthropologists who prepared 
them should have been so oblivious to the demands of empirical science. 


(9) Nasal Depth. 


25. B.A. Report ((21), p. 10): “No. C2 Nasal depth. From the sub-nasal point 
to the most projecting point of the tip of the nose.” Martin ((14), p- 167) gives 
practically an identical definition—No. 22 “Linge des Nasenbodens (Tiefe der 
Nase, filschlich Hiéhe der Nase, Nasenelevation: saillie de la base du nez; nasal 
depth): Projektivische Entfernung des Subnasale vom Pronasale.” The Monaco 
Commission, No. 14 “Saillie de la base du nez,” also gives the same definition 
((20), p. 571): “Points anatomiques en avant: le point le plus saillant du lobule 
nasal; en arriére: le point ot le plan médian est coupé par la ligne transversale 
joignant le point le plus reculé de chacun- des plis naso-labiaux. Prendre la 
distance en projection de ses deux points avec un instrument approprié.” 

Both Koganei ((8), p. 255, No. 13) and Kubo ((10), p. 331, No. 36) define it as 
“die Prominenz der Nase vom Gesicht, gemessen vom Ansatze der Nasenschiede- 
wand an der Oberlippe bis zur Spitze,” and their measurements are comparable. 

26. We again notice a terminological ambiguity. Koganei calls this measure- 


ment “Nasenhéhe,” while Kubo calls it “Tiefe der Nase,’ but adds the word 
“ Hohe” within brackets. 


(10) Morphological (Total) Face Height. 


B.A. Report ((21), p. 9): “No. B 2 Total Face Height. From the nasion to 
the lower edge of the point of the chin. A contact measurement*.” Buxton calls 
it the “total facial heightt.” The Monaco Commission, No. 9 “Diamétre naso- 


- 


27. 


* The Census of India data were obtained under Risley’s directions and it is probable that the French 
technique (as defined by Topinard) was used throughout. 

+ A contact measurement is one to be taken without pressure. 

+ I may note in passing that this same measurement is called “Length of Face” in Notes and 
Queries, 1874 ((16), p. 4, No. 41), 1892 ((17), p. 24) and 1899 ((18), p. 24). But the name is changed to 
total facial height” in the 1912 edition ((19), p. 7). 
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mentonnier*. Méme technique que sur le squelette, en pressant un peu comme 
pour le précédent. Chercher le nasion en remontant avec l’ongle le dos du nez 
jusqu’au léger ressaut que fait le bord inférieur du frontal” ((20), p. 570). The 
anatomical points for the craniological measurements are defined, “en haut: nasion; 
en bas: bord inférieur de la mandibule, dans le plan médian” ((20), p. 564, No. 11). 


Hrdli¢ka calls it the “chin-nasion height” in his work on the Egyptians ((4), 
p- 61), and describes it as “the distance from the lowest point of the chin in the 
median line to a point corresponding to the middle of the fronto-nasal articulation.” 
In his later work on Anthropometry ((5), p. 72) he calls it the “Anatomical Face 
Length” and defines it as “the distance from the menton (the lowest point in the 
middle of the bony chin) to the nasion.” 


It may be identified with Martin’s ((14), p. 165) No. 18 “Morphologische 
Gesichtshéhe (Nasomentale Gesichtshéhe; Kieferhshe; hauteur naso-mentonniére; 
total face length): Geradlinige Entfernung des Nasion vom Gnathion” and with 
Schmidt’s ((25), p. 107) No. 43 “der Abstand des Kinnes von der Nasenwurzel.” 

Koganei calls it “Gesichtshéhe B” after Virchow, and describes it as the chin- 
nasion distance: “die Entfernung der Nasenwurzel (Stirn-Nasennaht) vom unteren 
Kinnrande” ((8), p. 254). Kubo uses the name “Morphologische Gesichtshéhe” 
((10), No. 21) but gives the same definition as Koganei. Von Luschan calls it the 
“Nasenwurzel-Kinn” distance without giving further details. Sarasin uses the 
name “Nasomentale Héhe,” and as he usually follows Martin, his measurement 
may be taken to be the same as No. 18 of Martin. Shirokogoroff uses the name 
“Anatomical Face Length”; I have therefore identified his measurement with this 
present one which I am calling morphological (or total) face height. 


28. It will be noticed that the lower point of reference is spoken of indifferently 
as the “lower edge of the chin,” “the middle of the bony chin,” “the gnathion,” 
etc., and while there is general agreement about the region concerned, I have no 
information to judge whether they all refer in actual practice to the same identical 
point. As usual the authorities either give no directions at all, or else inadequate 
directions, regarding the point whether pressure is to be used or not in locating 
the gnathion and, if so, what degree of pressure. But assuming that there is no 
ambiguity about the lower point of reference it is clear that the same confusion 
exists about the upper point as in the case of the nasal height. I believe we may 
safely assume that each observer will stick to his own particular definition of the 
“nasion” in taking both measurements (i.e. “the nasal height” and “ the morpho- 
logical (total) face height”). On this assumption and ignoring the important 
question of pressure we obtain two distinct groups: 

(i) Maltese, New Caledonian, Mongolian*+, and possibly Korean. 

(11) Cretan, Egyptian, Old American, and possibly Aino and the Swedish data. 

* This is of course quite a different measurement from No. 8 “ Hauteur totale du visage” which 


corresponds to the Physiognomic Face Length. 
+ Shirokogoroff (Mongolian) may now be classed with Buxton and Sarasin. 
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It will perhaps be moderately safe to compare measurements within the same 
group, but a comparison between scries in different groups would be quite 
invalid *, 

(11) Bizygomatic Breadth. 

29. B.A. Report ((21), p. 9) calls it No. b 5 “Maximum interzygomatic breadth” 
and defines it as the “maximum diameter between corresponding points on the 
opposite zygomatic arches,” and notes that “the pressure used is to be as much as 
can be comfortably borne by the person under examination.” Buxton calls it 
“Bizygomatic Breadth” and Shirokogoroff uses the name “ Interzygomatic 
Breadth.” 

The Monaco Commission, No. 6 “Diamétre Bizygomatique. Méme technique 
que pour le crine. Chercher avec soin le maximum qui est souvent plus en arriére 
qu’on ne le suppose” ((20), p. 570). The craniological definition specifies the 
anatomical points as “face externe des apophyses zygomatiques” ((20), p. 564, No. 8). 

Hrdliéka calls it the “maximum bizygomatic diameter” and defines it as the 
distance between “the most widely separated points on the external surface of the 
zygomatic arches,” which is virtually the same as the definition of the Inter- 
national Commission’s No. 6. 

This measurement corresponds to Koganei’s “Gesichtsbreite a” which he 
defines as “der grisste Abstand der Jochbogen voneinander” ((8), p. 254). Kubo 
gives an identical definition but uses the name “Jochbogenbreite,” which is also 
the name used by Martin ((14), p. 160), No. 6 “Jochbogenbreite (oft falschlich Joch- 
breite genannt, Gesichtsbreite A nach Virchow; largeur totale de la face ou 
distance bi-zygomatique ; maximum interzygomatic breadth): Geradlinige Ent- 
fernung der beiden Zygia, d. h. der am meisten seitlich vorstehenden Punkte der 
Jochbogen voneinander.” ; 

Von Luschan and Sarasin both have measurements under the name “ Joch- 
breite.” From the above-mentioned remark of Martin that “Jochbogenbreite” is 
often wrongly called “Jochbreite,’ and from the fact that both von Luschan and 
Sarasin use this measurement to calculate the Facial Index, we may conclude that 
their “Jochbreite” is the same as our “ Bizygomatic Breadth.” 

30. A difficulty remains about the use of pressure in taking measurements. 
The B.A. Report alone gives instructions on this point, all other authorities 
remaining silent. In the absence of any information as to pressure we may, 
perhaps, assume that all the measurements are comparable, but it must not be 
forgotten that there is a possibility of a statistically significant error being intro- 
duced when we make such an assumption. 


(12) Morphological Facial Index. 
31. The Morphological Facial Index is defined as the ratio of the Morphological 
Face Height to the Bizygomatic Breadth multiplied by 100. The same comparisons 


* With a mean §.D. of 6-4 mm. and a difference of 4 mm. (Havelock Charles’s value for the difference 


between the two definitions of the nasion), we get a difference of 22 times the variance in (M-M ’)? in 
samples of 100, 
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will therefore be possible as in the case of the Morphological Face Height. That 
is, we have two distinct groups, and this only if we neglect any pressure difference: 
(i) Maltese, New Caledonian, Mongolian, and possibly Korean. 


(il) Cretan, Egyptian, Old American, and possibly Aino and the Swedish data. 


(13) Upper Face Length. 


32. B.A. Report ((21), p. 9): No. B 1 “Upper Face Length from the nasion to 
the edge of the gum between the two upper central incisor teeth. A contact 
measurement,” which may correspond to Martin’s ((14), p. 165) No, 20 “Morpho- 
logische Obergesichtshéhe (diamétre nasio-alvéolaire, upper face length): Geradlinige 
Entfernung des Nasion vom Prosthion,” and to the International Commission’s 
No. 11 “Diamétre nasio-alvéolaire.” Same technique as on crania for which the 
anatomical points are defined: “en haut: nasion; en bas: le point le plus inférieur 
du bord alvéolaire, entre les deux incisives médianes et supérieures ” ((20), p. 565, 
No. 12). The “Upper Face Length”* of Buxton (Maltese) and Shirokogoroff 
(Mongolian) are therefore comparable t. 


33. Koganei ((8), p. 254) has one measurement which he calls the “Mittel- 
gesichtshéhe” after Virchow, and defines as “die Entfernung der Nasenwurzel von 
der Mundspalte (bei geschlossenen Munde).” Kubo ((10), p. 259) gives an identical 
definition, and his measurements are therefore comparable with Koganei’s. This 
measurement corresponds to Martin’s ((14), p. 165) No. 20 “Physiognomische Ober- 
gesichtshihe (Mittelgesichtshohe nach Virchow, diamétre nasio-buccal): Geradlinige 
Entfernung des Nasion vom Stomion,” and to the International Commission’s No, 10 
“Diamétre naso-buccal. Points anatomiques dans le plan médian: en haut: nasion; 
en bas: interligne des lévres” ((20), p. 570). This measurement is therefore quite 
different from the “Upper face length” of the B.A. Report. The measurements of 
Koganei ‘Aino) and Kubo (Korean) while comparable with each other are not 
therefore comparable with those of Buxton (Maltese) or Shirokogoroff (Mongolian). 


(14) Minimum Frontal Diameter. 


34. B.A. Report ((21), p. 2): No. A 3 “Minimum Frontal Diameter. From one 
frontal crest to the other across the narrowest part of the forehead.” Buxton, 
Shirokogoroff and Hrdli¢ka use the same name, and the latter defines it ((5), p. 73) 
as “the shortest horizontal diameter between the two temporal crests,” which is 
also virtually the same definition as that given for the International Commission’s 
No. 4 “Largeur frontale minima” ((20), p. 570), the technique being the same as 


* The use of the word “‘ length” in connection with this measurement is a little inconsistent, for the 
word “ height” is used for an exactly analogous measurement, namely the ‘‘ morphological (total) face 
height.” 

+ We have here a good illustration of the confused terminvlogy at present in use. We speak of 
Morphological Face Height and Upper Face Length! It would be well to keep the term Height for 
projected lengths taken in the vertical, i.e. perpendicular to the standard horizontal plane. 
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on the skull: “C’est le diamétre horizontal le plus court entre les deux crétes 
temporales du frontal” ((20), p. 564, No. 5). 

It corresponds to Martin’s ((14), p. 159) No. 4, “Kleinste Stirnbreite: Gerad- 
linige Entfernung der beiden Frontotemporalia voneinander.” Von Luschan uses 
the same name “Kleinste Stirnbreite,” as also Koganei ((8), p. 254, No. 4) and 
Kubo ((10), p. 240, No. 19), both of whom define it as “der kleinste Abstand der 
Schlafenlinien am Stirnbein.” 

I should note here that Martin explicitly states that the instrument should be 
placed in such a way that no pressure is exerted (“ohne zu driicken”), while all other 
authorities are silent on this point. I da not know what difference will be made 
by the application of different degrees of pressure, but if we could neglect such 
differences then all the measurements would be comparable. 


(15) Bigonial Diameter. 

35. B.A. Report ((21), p. 9): No. B8 “Gonial Breadth. The diameter between 
the extreme outer points of the angles of the lower jaw. The measurement is to be 
taken with the maximum comfortable pressure.” Martin’s definition ((14), p. 161) is: 
“No. 8 Unterkieferwinkelbreite (Untergesichtsbreite, uniere oder mandibulire 
Gesichtsbreite; largeur mandibulaire, bigoniaque ou bigonial; bigonial breadth) : 
Geradlinige Entfernung der beiden Gonia voneinander. Die Tasterspitzen sind 
nicht hinten an die Unterkieferwinkel, sondern vielmehr etwas seitlich unmittelbar 
oberhalb des Randes aufzusetzen, damit die aussere seitliche Ausladung der 
Unterkieferwinkel, die sehr stark sein kann, mitgemessen wird.” Nothing is said 
of the degree of pressure to be used in taking the measurement but, otherwise, it 
appears to be comparable with the B.A. “gonial breadth” and with the “diamétre 
bigonique” of the International Commission ((20), p. 570, No. 7). 

Buxton and Shirokogoroff call it “Bigonial Breadth,” while Hrdlitka calls it 
“Bigonial Diameter” and defines it ((5), p. 73) as the distance between “the gonions 
or points of the angles of the lower jaw.” 


Kogane* (5), p. 254, No. 10) calls it “Gesichtsbreite C,” and defines it as “der 
Abstand cer Kieferwinkel voneinander.” Kubo ((10), p. 279, No. 27) uses the name 


“T's terkieferwinkelbreite” and describes it as Virchow’s “Gesichtsbreite C.” Von 
iwaschan calls it “ Kieferwinkelbreite.” 

It is by no means clear that all these measurements were taken between the 
outer points of the angles as Martin directs and, indeed, all the definitions of the 
gonia of the living face that have been given are lacking in precision and they might 
be interpreted in different ways by different observers. Another source of uncertainty 
is introduced by the absence of instructions regarding the question of pressure and 
for this measurement that is a vital point. No pair of these bigonial diameters can 
be compared without making assumptions which may be unjustified. 


(16) Internal Ocular Breadth. 
36. B.A. Report ((21), p. 9): No. B7 “Internal Ocular Breadth. The diameter 
between the two internal canthi of the eyelids. A contact measurement.” It adds 











P. C. MAHALANOBIS 23 


the direction that the measurement is to be taken when the eyes are open. Buxton 
who uses the same name, and Shirokogoroff who has one measurement called 
“Tnternal Interocular Breadth,” have probably followed the above definition which 
is virtually the same as the International Commission’s No. 16 “Largeur bipalpé- 
brale interne. Points anatomiques: angle interne de chaque ceil, sans s’occuper de 
la caroncule” ((20), p. 571). Martin ((14), p. 161) gives it as No. 9 “Breite zwischen 
den inneren Augenwinkeln (Nasenwurzelbreite, obere Nasenbreite; largeur inter- 
oculaire, bi-caronculaire, bi-angulaire interne, bi-oculaire interne ou bi-palpébrale ; 
internal ocular or interocular breadth): Geradlinige Entfernung der beiden 
Entokanthia voneinander bei offener Lidspalte.” Koganei ((8), p. 255, No. 11) 
describes it as “Distanz zwischen den inneren Augenwinkel” which is probably 
the same as the above described measurement. 


37. Kubo on the other hand is a little uncertain. He calls it ((10), p. 295, No. 30) 
“Breite zwischen den inneren Augenwinkeln (Interocularabstand)” which would 
apparently identify his measurement with the above-mentioned sets. But in his 
description he distinctly mentions the “Abstand der inneren Orbitalriinder des 
knéchernen Schiidels” which is taken verbatim from Schmidt ((25), p. 106, No. 38). 
This would imply that he was actually measuring the distance between the two 
internal edges of the orbital ridge, which distance of course may differ a little from 
the distance between the two internal canthi. Comparison with other sets is there- 
fore a little doubtful. 


(17) Haternal Ocular Breadth. 


38. B.A. Report ((21), p. 9): No. B6 “External Ocular Breadth. The diameter 
between the two external canthi of the eyelids. A contact measurement.” Measure- 
ment to be taken when the eyes are open. 

Shirokogoroff who calls it ((27), p. 1, No. 19) “External interocular breadth” has 
probably followed the above definition. Koganei describes it ((8), p. 255, No. 12) as 
“Distanz der diusseren Augenwinkeln,” and his measurements are probably compar- 
able with those of Shir Ikogoroft. 

The above measurement corresponds to Schmidt's ((25), p. 106) No. 37 “Breite 
zwischen den beiden iiusseren Augenwinkeln,” to the International Commission’s 
No. 15 “Largeur bi-palpébrale externe. Points anatomiques: angle externe de 
chaque ceil, dans sa région profonde, en contact immédiat avec le globe de l’ceil. 
Les yeux du sujet étant bien ouverts, le regard un peu au-dessus de horizon, viser 
ce point avec les branches du compas appuyées sur les joues du sujet” ((20), p. 571), 
and to Martin’s ((14), p. 161) No. 10 “Breite zwischen den ausseren Augenwinkeln 
(Obergesichtsbreite nach Weisbach; largeur bi-oculaire externe, bi-angulaire, bi- 
palpébrale externe; external biocular breadth): Geradlinige Entfernung der beiden 
Ektokanthia voneinander bei offener Lidspalte.” 


39. Kubo is again doubtful. He calls it ((10), p. 301, No. 31) “Breite zwischen 
den ausseren Augenwinkeln,” but actually defines it as “Breite zwischen den 
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aiusseren knéchernen Augenhdhlenrindern.” This of course is a quite different 
measurement corresponding to the B.A. Report ((21), p. 9) No. B5 “External 
Orbital Breadth. Maximum diameter between the outer margins of the orbital 
openings,” to Schmidt’s ((25), p. 106) No. 36 “Breite zwischen den iusseren 
knéchernen Augenhéhlenrindern,” and to Martin’s ((14), p. 162) No. 10 (1) “Breite 
zwischen den aiusseren Augenhéhlenriindern (diamétre orbitaire ; external orbital 
breadth): Geradlinige Entfernung der beiden fiusseren Orbitalrinder im Niveau 
der tiusseren Lidkommissar voneinander.” Kubo’s measurement* cannot therefore 
be compared with either Shirokogoroff’s or Koganei’s, but it may be compared 
with Buxton’s who definitely took the “External Orbital Breadth” (B5 of the 
B.A. Report) and not the “External Ocular Breadth” (B 6). 


(18) Breadth of the Mouth: 


40. Hrdlicka defines it as the “distance between the angles of the mouth at 
points where the mucous membrane joins the skin, with mouth naturally closed, 
without tension” ((5), p. 75), which is practically a literal translation of the Inter- 
national Commission’s No, 17 “Largeur de la bouche. Points anatomiques: com- 
missures des lévres, au point ot la muqueuse se continue avec la peau; prendre leur 
distance, la bouche étant dans sa position moyenne” ((20), p. 571). 


Koganei ((8), p. 255, No. 16) calls it “Liinge des Mundes” but does not give any 
definition, Kubo calls it ((10), p. 339, No. 37) “Breite (Linge) des Mundes,” and 
notes that it was measured with the mouth closed. The addition of the word 
“Linge” within brackets shows that it was formerly known by that name. It 
corresponds to Martin’s ((14), p. 163) No. 14 “Breite der Mundspalte (Mundlinge; 
longueur buccale): Geradlinige Entfernung der beiden Cheilia voneinander. Der 
Mund muss geschlossen und in Ruhelage sein.” All the measurements are therefore 
comparable. 


(19) Length of the Ear. 


41. B.A. Report ((21), p. 10): D 2 “Greatest length of the ear. From the highest 
to the lowest point of the auricle.” Shirokogoroff uses the same name. 

Monaco Commission: No. 19 (a) “Oreille—Longueur maxima: Points anato- 
miques: en haut: le point le plus élevé du bord de l’hélix; en bas: extrémité 
inférieure du lobule. Placer la tige du compas paralléle au grand axe de loreille 
et ses branches tangentes aux points indiqués, sans presser” ((20), p. 572). 


Hrdlicka called it “Height of the Kary” in his paper on the Egyptians ((4), 
p. 91), and stated that his measurement was the same as that of Topinard ((29), 
1885, p. 1004), Weisbach (Zeit. fiir Hthnologie, Bd. 1x, suppiement, 1878), and 

* This is another example of a measurement being called by a name properly belonging to an 
altovether different measurement. 


+ It would be well to retain this term for the projection of the maximum length on the vertical, i.e. 


when the rod of the callipers is held perpendicular, with the head adjusted to the standard horizontal 
plane. 
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Schwalbe (Beitriige zur Anthropologie des Ohres—Virchow’s Festschrift, 1891, p. 95). 
Schwalbe himself called it “Grisste Liinge des Ohres” ((26), pp. 115—114) and 
mentioned that “dieses Mass ist identisch mit dem, welches von Weisbach und 
Virchow als Héhe des Ohres bezeichnet wird. Die grésste Linge des Ohrscheitels 
bis zum entferntesten Punkt des unteren Ohrrandes.” Comparing Schwalbe’s 
diagram ((26), p. 115, Fig. 10, ab = LZ) with that of the B.A. Report (fig. 1, AB), 
I find that the two definitions are identical. 





42. In his later book ((5), p. 76) Hrdli¢ka changed the name to “Maximum 
Length” and defined the landmarks as follows: “Superiorly the highest point on 
the border of the helix; inferiorly the lowest point on the lobule. The rod of the 
compass should be held parallel to the long axis of the ear; use no pressure.” This 
is again practically a translation of the International Commission’s No. 19 (a). 

43. Koganei calls it ((8), p. 255, No. 17) “Liinge des Ohres,” and defines it: 
“vom unteren Ende des Ohrlaippchens bis zum hochsten Punkte der Ohrmuschel 
gemessen.” 

Kubo calls it ((10), p. 349, No. 39) “Physiognomische Linge des Ohres (Hohe 
des Ohres)” and following von Luschan ((11), p. 40, No. 54) defines it as “dessen 
wirkliche Héhe zwischen den wagerechten Tangenten an dem oberen und dem 
unteren Rand.” The measurement corresponds to Martin’s ((14), p. 168) No. 29 
“Physiognomische Liinge des Ohres (grisste Linge des ganzen Ohres nach 
Schwalbe): Geradlinige Entfernung des Ohrscheitels (Superaurale) von dem 
tiefsten Punkte des Ohrlappchens (Subaurale).” Probably all the measurements are 
comparable. 


IlI. THe CoMPaARATIVE VALUE OF THE MATERIAL, 

44. It may be presumed that the chief, if not the only, reason for taking 
anthropometric measurements is to use such measurements for purposes of com- 
parison, that is: for the study of racial or groun resemblances and divergences. 
The real value of any particular collection of material would then be determined 
by the possibilities of comparison offered by it. Any portion of the material which 
is not available for comparison may therefore be considered useless for our purposes. 

45. Let us consider the present material. Counting Shirokogoroff’s material 
as one single unit we have nine distinct sets of observations taken by nine different 
observers. For any particular character for which comparable measurements are 
available for every set, $(9 x 8)=36 different comparisons will be theoretically 
possible. Owing to lack of agreement in definitions or technique all measurements 
however may not be comparable, in which case the number of comparisons actually 
possible will be considerably reduced. For example, for the Nasal Index we have 


the following groups: 
(i) Korean (Kubo) 
(ii) Aino (Koganei) 
(iii) Mongolian (Shirokogoroff) 
(iv) Swedish (Lundborg) 


Single sets, no comparison possible. 
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(v) Maltese (Buxton) 1 7 : ‘bl 
“pees ae . . | Two sets, 1 comparison possible. 
New Caledonian (Sarasin) ( ‘ P I 
(vi) Cretan (von Luschan) 
Egyptian (Hrdlicka) Three sets, 3 comparisons possible. 
Old American (Hrdli¢ka) 


Thus only 4 (out of a possible 36) comparisons are feasible in practice. We 
may reckon the effective value of the material to be 4/36 or about 11°/, only. 

I give below, Table I, such detailed analysis for each character separately. 
Col. (3) gives the total number of sets for which a particular measurement is 
available, Col. (4) the number of comparisons which would have been possible if 
all measurements had been strictly comparable, Col. (5) the number of comparisons 
which might be reasonably made in practice, though even then they include 
several—such as those between the bigonial diameters (see Section (15) above)— 
which could only be justified fully if far more adequate information were available 


TABLE I. 








| (1) | (2) | (3) (4) | (5) (6) (7) | (8) 
rr | —_| —;—— 

*1 | Head Length Ree a ne ee 9 25 12 | 33 
*2 | Head Breadth a ae a. oe ee 9 25 16 | 4 

| *3 | Cephalic Index... ‘ea we te 36 9 25 12 33 
4 | Height of Head |. 5 10 | 2! 2% 2| 2 

5 | Horizontal Circumfere nee of He ad | 2 l | l 100 1 | 100 

*6 | Nasal Height ae - cms 9 36 { 1] 13 | 3 
*7 | Nasal Breadth... am sé 9 36 16 44 | 2 61 
*8 | Nasal Index ase et wind 9 36 4 1] 4 | 11 

9 | Nasal Depth ‘ ww i Sta 4 100 1 | 100 
*10 | Morphological Face Height — ... 9 | 36 6 17 16 | 4% 
*11 | Bizygomatic Diameter . nee 9 36 36 100 | 36 | 100 
*12 | Morphologic: 11 Facial Index... 9 36 | 6 17 16 | 44 
*13 | Upper Face Length | 4 6 2 33 2 | 33 
14 Minimum Frontal Diameter 8 8 | 28 100 28 100 

15 | Bigonial Diameter P ai 7 21 | 2 100 21 100 

16 | Internal Ocular Breadth .. 5 10 6 60 6 60 
*17 External Ocular Breadth 4 6 | 2 33 2) 33 
18 | Breadth of Mouth... ) 10 10 100 10 | 100 

| 19 Length of Ear 5 10 10 100 10 | 100 
Total sins ae wae aoe 427 | 182 12°6 230 53°8 





regarding the definitions of points and measurements. Lacking any direct negative 
evidence, it has been assumed that the different workers used the same degree of 
pressure in determining a particular measurement. If it were demanded that the 
comparison should be perfectly rigorous, so that the absence of information on 
such a point would necessitate the rejection of the observation, then the strictly 
comparable material would be reduced almost to none. Col. (6) is the percentage 
effective value (or efficiency) given by the ratio of (5) to (4) expressed as a 
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percentage, Col. (7) and Col. (8) corresponding numbers if all doubtful cases are 
included as being available for comparison. 

Adding up for all characters we notice that only 182 comparisons can be made 
with any degree of certainty* out of a possible 427, giving an effective value of 
42°6°/,. If doubtful cases were included the actual number and percentage would 
be increased to 230 and 53°8°/, respectively. The above analysis does not however 
give a fair idea of the real position, for in the above list we have included 
number of characters which are rather rare. 

46. Let us confine ourselves to the standard list given in Notes and Queries (19). 
It consists of 9 measurements of the head, out of which we have 8 in the above 
list. Including the three relevant indices we obtain a total of 11 measurements 
(marked with an asterisk in Table I) for which we now get Table II. 


TABLE II. 


Number of Comparisons 





Actually possible 


| Theoretically |—— = ———— 

| possible | Rejecting pr tful | Including doubtful 
| cases | cases 

& cesar, [9 Seams = ee Aa 
bam 336 | 103 | 151 

| Siena age 30°6 45-0 





If we further omit the two characters Nos. (13) and (17) in Table I, i.e. restrict 
ourselves to the characters which are available in every set, we get the following 
table. 

TABLE IIL. 


Number of Comparisons 





| Actually possible 
Theoretically 
possible 











a : 
| 
; Rejecting doubtful Including doubtful 

cases cases 

a ee ee ee ee ee 

| 324 | 99 147 

= es Se ee! eee a 

Percentage | 30°6 | 15°4 

The appalling fact then comes out that only from 30°/, to 45°/, of the 


theoretically possible comparisons can actually be m: ade in practice, i.e. fully from 


* Even this assumes that the same amount of pressure has been used by different observers in 
measuring Morphological Face Height, Bizygomatic Diameter, Minimum Frontal Diameter and 
Bigonial Diameter. If different degrees of pressure were employed, and such differences cannot be 
legitimately ignored, then the actual number of possible comparisons is reduced to 85 or to less than 20°/, 
of the number theoretically possible. Not neglecting pressure differences the possible comparisons in 
Table II are 55 or 16°4°/, and in Table III, 51 or 15:7°/,! 
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55°/, to 70°/, of the usefulness of the material is wasted owing to lack of agree- 
ment in definitions and technique. A more exacting critic might be unwilling to 
make some of the assumptions which we have made in order to obtain these 
figures. If he demand that comparison should be restricted to measurements 
certainly known to have been found by identical methods then he would find that 
only 20°/, or even 15°/, of the possible comparisons could be made legitimately. 
To put it in a different way the effective value of the present material might have 
been increased at least two- or three-fold and possibly five- or six-fold by proper 
standardisation without any additional expenditure of energy, time or money in 
collecting the material. 


47. I may note here that the present material was originally collected with a 
view to obtaining as large a number of comparable characters as possible. In fact 
I rejected several series of observations where a preliminary examination showed 
that a sufficient number of comparable characters would not be available. Thus 
my very generous estimate of a wastage of from 55°/, to 70°/, is more likely to be 
an under-estimate than otherwise for measurements in general. 

The actual situation is indeed much worse owing to another reason. Ordinarily 
one gets from 10 to 15 different measurements (on the head) in good sets of 
observations, and only rarely 20 or more. So that owing to the reduction in 
number of measurements due to lack of standardisation the number of characters 
actually available for comparison may be easily reduced to anything from 5 to 8 on 
an average. But for serious comparative studies it is absolutely essential to have 
a larger number of characters, preferably 20 or more. Thus the reduction in 
number of available characters, owing to want of standardisation, will very often 
render the whole material practically useless for comparative purposes. 

We are obliged to conclude that even from the best of the series of head 
measurements at present available we are quite unable to establish reliable inter- 
racial comparisons. The labour of estimating how far the measurements may be 
legitimately compared is not one from which the biometrician is likely to derive 
much profit, for he will usually be left with insufficient material to serve his needs. 
The comparisons frequently made between similar measurements taken according 
to different methods of technique are merely idle. 


[V. Conc.usions. 

48. The present study indicates that the greater part of the existing anthro- 
pometric measurements on the living are probably useless for comparative purposes 
owing to lack of agreement in definitions and technique. 

Such lack of standardisation is due to the existence of a multiplicity of “standard 
lists” each with its own adherents. The International Agreement of 1906 has 
failed to achieve any appreciable degree of uniformity among anthropometric workers. 


Out of eight observers considered in this paper (all of whom carried out their present 
investigations after 1906) only two (Hrdli¢ka and Shirokogoroff) mention the 1906 
Agreement, but both of them introduce departures from this Agreement. Several 
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important standard lists, e.g. the British Association Report (1908), Notes and 
Queries (1912), Martin’s list (1914), were drawn up after the International list, but 
have made substantial departures from it. 


The failure of the International Agreement of 1906 may be ascribed to several 
causes. Specifications are in many places vague and uncertain, detailed descriptions 
are not often given about the actual procedure to be followed in taking measure- 
ments, and the list contains too few characters. Finally it does not appear to have 
been readily available in a convenient form. 


It is hardly necessary to point out that no progress in comparative Anthro- 
pometry will be possible until arrangements have been made for securing adequate 
uniformity of standards among different observers. The only way in which this can 
be achieved is to convene another International Conference for drawing up a more 
satisfactory set of standards than the Monaco (1906) Agreement. 


The greater part of this paper was written in the Biometric Laboratory, Uni- 
versity of London, and I have to thank Prof. Karl Pearson for his active interest 
and valuable suggestions at every stage of my work. I also acknowledge with 


gratitude the help I have received from Dr G. M. Morant of the same laboratory. 


APPENDIX I. 


The following table gives a rough idea of the order of the difference made in 
Z=(M — M’)/>(4-w for certain estimated values of the difference in means arising 
from differences in technique. Col. (3) gives the total number of individual measure- 
ments (taken from the data considered in the present paper) on which the average 

















j | | | 
| | (3) | (4) ¢ (5) (6) 
(1) | (2) we ot Mean | Estimated | ‘ Z” for 
No. | Name of character Suciteianais Standard difference } samples 
| | | ~~ | deviation in Means of 100 
| | | 
a eee 
1 | Head Length a — = | 1863 692mm. | 3mm. 3-0 
a | Head Breadth 1864 } 5°72 mm. 3 mm. 3°7 
3 | Cephalic Index 1863 4:20 0-4 0-7 
4 | Height of Head 795 6°13 mm. 3 mm. 3°4 
5 | Horizontal Cire umference of Hea ad | 640 | 13°25 mm. 
6 | Nasal Height au ee ce | 1856 | 374mm. | 4mm. 75 
7 | Nasal Breadth... Sc ee 1852 |} 293mm. | 5mm. 12°1 
8 Nasal Index = ree one | 1850 | 7:97 5°0 4-4 
9 Nasal Depth ‘ neni 440 | 1°95 mm. 
10 Morphologic: ul Face » Height oe 0 1852 6°41 mm. 4mm. 4-4 
11 Bizygomatic Diameter : ar 1855 | 545 mm. 
12 | Morphological Facial Index... | 1847, | 4°72 30 5°7 
13 Upper Face Length - | 592 | 3°81 mm. 5 mm. 92 
14 Minimum Frontal Diameter ; 1612 4°96 mm. 
15 Bigonial Diameter : | 1620 6°07 mm. 
16 Internal Ocular Breadth .. | 1510 2°91 mm. 3 mm. 72 
17 External Ocular Breadth 1185 | 4°87 mm. 3 mm. 43 
18 Breadth of Mouth 866 3°72 
19 Length of Ear 735 4°66 mm. 
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standard deviation (@) given in Col. (4) is based. Col. (5) gives the estimated 
difference in means due to differences in technique. I may note that these are on 
the whole conservative estimates. Col. (6) gives “Z” for samples of 100. It will be 
seen that except for the Cephalic Index differences in technique will certainly not 
be negligible in comparison with errors of sampling when the group consists of 100 
individuals or more. 


19. 
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THE GENERALISED PRODUCT MOMENT DISTRIBUTION 
IN SAMPLES FROM A NORMAL MULTIVARIATE POPU- 
LATION. 


By JOHN WISHART, M.A., B.Sc. Statistical Department, Rothamsted 
Experimental Station. 

1. Introduction. 

For some years prior to 1915, various writers struggled with the problems that 
arise when samples are taken from uni-variate and bi-variate populations, assumed 
in most cases for simplicity to be normal. Thus “Student,” in 1908*, by considering 
the first four moments, was led by K. Pearson’s methods to infer the distribution 
of standard deviations, in samples from a normal population. His results, for com- 
parison with others to be deduced later, will be stated in the form 





1 N-1 N-3 
dp = > i oe ST (1), 
r N-1)\ 
\ 9 } 
where N is the size of the sample, and 
N 
A= Fo?’ a= s*, 


o being the standard deviation of the sampled population, and s that estimated 
from the sample. Thus, if #,, 7, ... ay are the sample values, 


N 
Ni = > (x), 


N 
and Nst=>(«#-7Z/. 
1 


When bi-variate populations were considered, other problems arose, such as the 
distribution of the correlation coefficient and of the regression coefficient in 
samples. These problems, taken by themselves, were found to be difficult, and 
only approximative results had been reached, when, in 1915, R. A. Fishert+ gave a 
formula for the simultaneous distribution of the three quadratic statistical deriva- 
tives, namely the two variances (squared standard deviations) and the product 
moment coefficient. Thus, let 2,, a. 


», «+» @y represent the sample values of the 
x-variate, and y,, yz, ... y the corresponding values for the y-variate, let o, and o, 
be the standard deviations of the sampled population and p the correlation 


between # and y. We then calculate the following statistical derivatives from the 


sample : 


N N 
Nz = > (a) Ny=X(y) 
1 1 
N 4 
Ns? = («#—) Ns? = =X(y—yy 
1 1 


N 
Nrs,s.= = (a — 2) (y — 7). 


* Biometrika, Vol. v1. 1908, pp. 4—6. 
+ Biometrika, Vol. x. 1915, p. 510. 
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N N N, 
at. Be apie 
2o,°(1—p’)’ ~  207(1—p)’ " 20,0, (1 —p°)’ 


a= 8,3, b=s.2, h = 8,801, 


If we put A 








then Fisher's result, for the simultaneous distribution of a, b and h, may be put in 
the symmetrical form * 
1 |A H|N-1 — a h|N=4 
= (Nt > ei Mii 2 dadbdh 
izr(Z 5) (> HS | hb 
Nor ( I ) 

9 ) 2 





dp = 


The distribution of the correlation coefficient was deduced by direct integra- 
tion from this result. Further, K. Pearson and V. Romanovsky, starting from this 
fundamental formula, were able to deal with the regression coefficients. Pearson, 
in 1925 +, gave the mean value and standard deviation of the regression coefficient, 
while Romanovsky and Pearson}, in the following year, published the actual 
distribution. 

New problems arise when we are dealing with three or more variates. There 
are, for example, the distributions of the partial and multiple correlations, and of 
the partial regression coefficients. In this domain our knowledge is still far from 
complete. The partial correlation coefticient has been shown to be distributed 
exactly as a total coefficient, when the size of the sample has been reduced by the 
number of variates climinated§. The actual distribution of the multiple correla- 
tion coefficient, for the particular case where no real correlation exists in the sampled 
population, was shown in 1923)| to be of the form 
(N — t\ 


,_(N—-n x na Ty 
V3 ice } 


where n is the total number of variates. A recent experimental investigation by 





n-3 N-? 
(72) (2=26) 2 GG), ccsncane (3), 


the present writer? may be regarded as confirming this result, as well as was to be 
expected on the basis of the inadequate numbers used. On the other hand, 
L. Isserlis**, in 1917, found approximations to the mean value and standard devia- 
tion of the multiple correlation coefficient, in the case of three variates, while 
recently P. Hall++, for an n-fold system, determined (a) the mean value, and (b) the 
standard deviation, of the multiple and partial correlation coefficients, as far as 


~ 
terms of the order of Vv" 


* See Romanovsky, Comptes Rendus, Tome 180, 1925, p. 1897. 

Biometrika, Vol. xvi1. 1925, pp. 195—196. 

Bulletin de VAcadémie des Sciences de ? U.R.S.S., 1926, p. 646. Proc. Roy. Soc. A, 112, 1926, p. 1. 
R. A. Fisher, Metron, Vol. 11. Nos. 3—4, 1924, pp. 1—2. 

R. A. Fisher, Phil. Trans. B. Vol. 213, 1923, p. 91. 

{| Memoirs of the Royal Meteorological Society, Vol. 11. pp. 29—37, 1928. Rothamsted Memoirs, 


Oe ++ + 


Vol, xiv. 
** Phil. Mag. Vol. xxx1v. 1917, pp. 205—220. 
++ Biometrika, Vol. xix. 1927, pp. 100—109. 


Biometrika xxA 
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What is now asserted is that all such problems depend, in the first instance, 
on the determination of a fundamental frequency distribution, which will be a 
generalisation of equation (2). It will, in fact, be the simultaneous distribution in 
; aa? n(n—1) 
samples of the n variances (squared standard deviations) and the — — product 
moment coefficients. It is the purpose of the present paper to give this generalised 
distribution, and to calculate its moments up to the fourth order. The case of 
three variates will first be considered in detail, and thereafter a proof for the 
general n-fold system will be given. 
2. Tri-variate Product Moment Distribution. 
Let the frequency distribution of the population sampled be 
1 
dp = -—_—______ 
(2a)? o,0,0,VA 
— Pa 4 Waal agg Hal gg Oa ad E— gg  glEo meme) a, 4 gle mdi) 
xe 2A a og os" G203 G39) 91% 
dadydz Saakcnuien (4), 
where A is the determinant | p,,| s, #=1, 2, 3, and A,, is the minor of p,, in A. 
Now let x, 72, ... @y represent the sample values of the #-variate, and 
Tay Bar ose Bits Bae Say «+> Oy ' 
be the corresponding values for the y- and z-variates. Then the chance that 
2, Yis Zi, Va, Yo, 22, «+» YN, YN, ZN 
should fall within the elementary ranges da, dy,, dz, ... dwy, dyy, dzy is 
1 
dp = 3NV a > 
(277) 2 (o,0,0;)% A2 
2 2 z(' t mi)? An+! = ae (z- ms)? gy — me) (2 =m) ig Mg) (x - fo E gem) 9.) 7 
xe 2A l “’ 2 o;° F203 = O30) 0,9 
da,dy,dz,da,dy,dz, ...... duydynd2y ........000. (5). 
The following statistical derivatives are now to be calculated from the sample 
and substituted in (5): 
N N N i 
Na = 3% (#) Ny==(y) N z= 2, (2) 
1 l 4 
. N é N N 
Ns? => («#—#), Ns2= > (y-yy Ns? => (z- 2? 
1 1 1 
, N N N _ 
N 3828; = X(y-—Y)(z- 2) Nrys,s8,= = (2 —2)(x#—2) N TS) 8 = &(a@—Z)(y—- Y)e 
1 1 1 


In order to transform the element of volume, we require to extend somewhat 
the geometrical reasoning employed in the two variate case*. The N values of « 
may be regarded geometrically as specifying a point P in an N-dimensional space, 


See Biometrika, Vol. x. pp. 509—510. 








. “2A, | 


(4), 
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whose co-ordinates are #,—@, #—@, ... ay —-%. Similarly the WV values of y and the 
N values of z specify points Q, R in the same space. When & and s, are fixed, as 
when a particular sample is chosen, P is constrained to move so that its perpen- 
dicular distances from the line #,=«,=...=ay and from the “ plane” 
+t... t+ ety = ne 

remain constant. It must therefore lie on the surface of an N —1-dimensional 
sphere which is everywhere at right angles to the radius vector #,=a,=...=#y. 
The element of volume is then proportional to (VN s,)¥~? ds,dz. For the factor of 
proportionality we require the entire area of the surface of a sphere in N-1 


dimensions. If of radius 7 this is, for V =3, 4, 5, 6, 7, 


Qerr, 4ar*, Q?r®, Sartrt, a1’, 
the general result being 2. ry 9 yi~s. 


M3 


Thus we have a contribution to the transformed element of volume of 
Qs V2 ds, dE. 
\ 


By similar reasoning Q and R must lie on concentric spheres in the same 
space, and there will be corresponding contributions to the transformed element of 
volume. Let the radii vectores OP, OQ, OR be cut by the unit sphere whose 


Pp 








R 


centre is at O in the points Z, M, N. Then LMN is a spherical triangle, specified 
by the nature of the sample. To find the chance that this particular triangle 
should be chosen we note that, P being fixed, the chance that Q (or M) should fall 


vin A ° 
within the elementary range d@, (@, being the angle 5 — LOM) is equal to 


cos 1-86,d0, + [cos N-36,d0,, 
0 
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i.e. to 





) , 
Vor r( 9 ) 


Similarly the chance that R (or NV) should fall within dé, (Lon = + 6.) is equal to 





‘(N-1 
(t= ; 
- : -——y cos 7-8 0, dé. 
Vol (= 2 


But the points M and N do not vary independently. 'They are connected by the 
relation that, if ¢ be the angle between the planes LOM and LON, 

Pes — TiN is 
VIl-rneVvl—ne 
Now LM being fixed, the chance that ZN should fall between the angles ¢ and 
$+ dd, measured from LM, is equal to 





C08 6 = Paya = 





r (A 
sin -4*h dp + "sin \-*hdb= = — sin ¥~*¢ d@. 
/ 0 Vel (- 5 %s 





The transformed volume element will consist of the product of all the above 
probabilities. The exponential term in (5) is easily expressed in terms of 8), 82, 8; 
and the 7’s, and we have 
3 (N-2) 
et 


dp = > 7 


pe = ray ° N 
a r (5 ") r (7 ; “) r (- = ‘) (80,20.20,2A)2 


LN re" m (e- m)? 82 + (J — my)? a 852 + (Z — my)? es lg 8o83 + (Y — My) (2 — ms) _ 


a; a” al o;” i G93 





Tx) 8a8, + (Z — m,) (X — m,) I oS) S> + (X — my)(¥ — M,) 
9 "315351 3 1 »@%19%1%2 ia TAL. 2) 
+2 Ag, + 2 ———— —— Aj 
O30; O10 


x 8,V~%8,4-28.4—-2 cos ¥-3 6, cos 4-36, sin 1-4 de dydz ds,ds,ds,d0,d0.dp 


We can integrate out immediately for 7, 7 and Zz. The integration produces 
(V/ 22)? Nia,c,0; VA, and we get 


5 








S NB \, V-1 
> te = 2 N —1 a N-2 fh = (<1 ” 
wil ( ji ( ) r ( 


N [fs;? -- $a" Poo So Sea Te: 8a% Tre8s te 
-5 I Ay + 2, Aog + 3 Ags +2 222 FA, + 2 FIA, + 2-2) * Av | 
xe 7ALer co 01%, 


2 3 O20, G39 
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Following Romanovsky* in the method of expression, let us put 





=a $18, sin 0, =h, 
s2=b $8, sin 8, =4, 
$2 =C Sy83 (COS 0, cos 6. cos i) + sin 6, sin 6.) =f; 





(for T2= sin 6, 7,;=sin 8, and r.,=cos¢= 


For the Jacobian of this final transformation we have 





of 
-_ 
0(a, b, c, f, q, h) , of og 
A : = 88,5283 | = : 
0 (81, S2, 83, A, A, ) 7 Gs: Os 
of 
x 0 
0b 


= 8s,'s,"s,° cos? 0, cos? 4, sin od 
= 8 {be (ab —h?) (ac — g*) | abe }*, 


where | abe | denotes the determinant 


a h q 
h b J 
g ff « 


The element of volume then becomes on transformation 


N-3 N-4 N-4 
2 {(ab —h*) (ac — g’)} 2 a2 l|abe|2 





Pez — TieM 3 ) 
Vl —neVv1l—n2 





mS 
|abe|~2 





N- 
1 (abe) 2 - atte N 
aN-3 (be) 2 {(ab — h®)(ac — g*)} 2 


a very neat result. 


—-" (be)? {(ab -_ h?) (ac = g)} 


Finally we have for the simultaneous distribution of the three variances and 


the three product moment coefficients the symmetrical expression 








1 1 N N-1 
= 7 ; - eT 2 
»n{N-1 N —2 N-38 uae a} 
=T( 2 )r( “ae ae ) 
N { a b 5 ( j 9 Lh \ 
xe “2A lo? 2 "6 4 o a Ans "3 Th + : 70,5" 
| @ ’ si 
x|h b f| dadbdedfdgdh. 
g9 f ¢ | 








* Comptes Rendus, loc. cit. See also Metron, Vol. v. No. 4, 1925, p. 31. 
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We may simplify this expression by writing 





dp = 


N An > N Aw “i N As 

aeays ail ae i vial oa 

4 NM N Bs N A, 

y = —, G= = = a 
. Se.0, A’ fo:6, &- ad tc,.c, & 
when it becomes 
1 i482 6S 
"N=, / Not 8 | HB F| " _ 9~ da- Bb - Ce ~ 28 f - 2Gg- 2Hh 
aps) re) 
- 2 2 G FF C€C 





| é dadbdcdfdgdh 
| 
| 


ene ioe (8). 
It is to be noted that | abc| is equal to s,2s,°s,2 ql, P, G=1, 2, 3. 

This is the fundamental frequency distribution for the three variate case, and 
in a later section the calculation of its moment coefficients will be dealt with. 

3. Multi-variate Distribution. Use of Quadratic co-ordinates. 

A comparison of equation (8) with the corresponding results (1) and (2) for 
uni-variate and bi-variate sampling, respectively, indicates the form the general 
result may be expected to take. In fact, we have for the simultaneous distribution 
in random samples of the n variances (squared standard deviations) and the 








n(n —1) ue ; : 
. product moment coefficients the following expression : 
[Ayn Aj... Ain |Xa 
Ag Ag... Am | 
; | 
| : | 
dp ee | Am A n2 Ann 


x @- And — Ag2422-. Ann Gun — 24 12% 2 — 2473 4)3 — «-. — 2A-pnGn-in 
1 Gy Ag... Ay 9 : 
x | oo _ | Ma cine. sche aecncscemeseet (9), 
| | 
1Qni Ono --+ Ann | 
Where yy = SpSo1pq, ANd Ay, = NT Ang A being the determinant 
Pq p°q' pq P41 2G, o, A 5 


and A,, the minor of pp, in A. 











»Hh 
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Chis result can be proved by the aid of the following general geometrical con- 
siderations. We shall begin by defining the quadratic co-ordinates of a set of 
points, and thereafter develop the argument by the use of these co-ordinates. 


(a) If 2, tp, ... “pn ave the rectangular co-ordinates of a point (p) in space of 
n dimensions, then the configuration of a set of n points relative to the origin may 
be specified by the co-ordinates 
n 
Enq = Real (LpkXqk)- 


- ‘ , 

hese co-ordinates will be unchanged by any rotation of the whole system 
about the origin, the configuration of the set of (n+ 1) points including the origin 
being unchanged. 


(b) The determinant | &,, |, p, g=1, 2,3, ... n, on expanding the several terms, 
may be recognised as the square of the determinant | «»,|, p, g=1, 2, 3, ... m, and 
is therefore equal to the square of the volume enclosed by completing the parallel- 
faced figure having one corner at the origin and edges running to the points of 
the figure. This volume will be represented by v,. 

(c) The perpendicular distance of the point (n) from the plane space passing 

nae : Sa ae 
through the origin and the points (1) to (n—1) is : 
n—1 

(d) If the points (1) to (n—1) are fixed, and that of the point (n) is specified 

by the co-ordinates &n,, En, ... Ean, then the element of volume in the neighbour- 


hood of the point (n) may be represented by 


l 
5, dem, Clas «ss Clan 
=Un 
For the Jacobian 
vy “2 Vin 
0(Enm; Eno, oe9 San) My A 2 Lon 5) 
= => = Zv 
0 ( Unis Un2> ++ Lnn) re = ” 
221n 2aon Zann 


(e) If in aspace of N dimensions the points (1) to (m — 1) are fixed, and that of 
the point (n) is specified by the co-ordinates En,, Eno, -.- Enx within ranges 
GE, dens, --- CEan, 
then the point (n) is free to move on the surface of a sphere in N—n+1 dimen- 
sions, of which the centre is the foot of the perpendicular from (n) upon the space 
containing the points (1) to (n—1). The radius of this sphere is therefore = . 
n—l 


and the area of its surface is 


4(N~n+1) fo N-n 

WT? 7 ar’ 

2 . “. ) (see above, p. 35). 
r(” —nt+ 2 ae 


2 





mie 
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This surface is everywhere normal to the space passing through the origin and 
the points (1) to (x). But the element of volume in this space is 


oe = 


2Un 
Hence the entire volume that may be occupied by (n) is 


mil -n+1) v N-n-1 0 
————-." .... bs ses svenbavetece 10). 
r (7 —S73\ i" GE ms Ens dE un on 

oc 


(f) The quadratic co-ordinates of n points in space of WV dimensions may be 
also regarded as co-ordinates of a single point in space of n sets, each of V dimen- 
sions. If the projections of the point upon the first n —1 spaces are fixed, the 


volume element corresponding to variations of the co-ordinates &,,, En, ... Ean will 
be that found above. 


(g) We now require the volume element corresponding to variaticns of all the 


co-ordinates of a point so specified in Nn dimensions. The component spaces of V 


dimensions each are defined by rectangular co-ordinates, hence the entire volume 


may be found by multiplying the volume element defined by the variations of 


En, En, tee | a by that defined by the variations of Sans Fan xe ae Conse- 
quently the volume element for variations of ali the co-ordinates is found by 
multiplying together the volumes (equation (10)) got by putting ” successively 
equal to 1, 2, ... x. The component volumes v,_, appear in these expressions 
successively in the numerator and the denominator with equai indices, and there- 
fore disappear from the product, thus we have finally for the volume element 


aku (2N-n+1) 





. ' . » N-n-1Jt 1 1m 0 wee 1). 
ri), (Sat) op (Nath): eee ee See 
(3)! ( —)el ( 


») > 





- / 


(h) Now write N—1 tor N and let &,,,= Nappy, t 


correspond with our earlier 
work. The above expression then becomes 


|} Qy, Ayo ... a 


arte (2N-n—1) ° Nin (N-1) | A (gq +++ Ug 


| 

| 

V—] N-2 N-n\|: : ca 
r(- r Ns Ons 2-2) Ae 
ra)e | 





day, da, ... da 











vin» 
5) 9 Paes \ 9 
i 4 g : iy : Oni One = Ann 
and this multiplied by a density factor 
n 9 ts 2B 1) 
-—(N-1 ;e2y~ 
(27) 2 ot ) 
‘ \AN 
becomes 
1 | Gn Ae ee 
| 2 
| 
n(n—1) YT r 9 + a Gy Ugg «++ Aon day, days eee dann 
ee 78-1\, (8-2, </H-0\| "| 
(Var) 2 |] —)\I T(—3-)] 3 > Ut | 
\"2 2) 2/1 | 
1 Gni ne Ann | 
Saoweecicouen (12), 


whence our general result (9) follows. 
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4. Moment Coefficients of the Distribution. 

It is now possible to find the moments, up to any desired order, of the 
simultaneous distribution given by (9). A number of these moments will be iden- 
tical with those found previously for the cases of one and two variates, and 
published by different writers. To illustrate for the case of three variates, let 





Y iN 1 
F (abefgh) = ; 1 a AR @% 
ar(=s*)r(=-)r (4) H BF 
\ 2 2 Ce ee 
G F ¢ 
ah g — ag 
x @—Aa-Bb-Ce-2Ff-2Gg-2Hh) pb Ff) caseracesees (13). 
g9 f ce 


Then the generalised product taoment coefficient of this distribution is given by 


where the limits of integration for a, b and ¢ are 0 to 2, and for f, g and h 
e ” eos — ° + ‘ 
— Vbc to + Vbe, — Vac to +Vac and —Vab to + Vab respectively. Now following 


the method of Romanovsky*, we define the function 


 (aByApv) = [ : da d) [ : dc [{ [2 (abefyn) etatbBteytfrtgut he df dgdh 


roe 


0 








svaseueaeeee (14) 
as the generating function of the product moments, which are given by 
Maines ™ |" nabbleb SO «| es (15). 
nae Oa! 0B* Ory'ON"™ Op? Ov! enfuy=A=p=vnt 
Since the total frequency volume in (8) is unity, we have 
|\4 @ -@ [242 
fe~ Au- Bo ~Co—21/-2Gy-2Hh dy = const.| HH B F| — 
é £ ©€ 





where the integration is taken throughout the volume. A parallel result is 
obtained for the integration in (14) simply by replacing A by A —a, B by B—8, 
H by H —}yp, etc., in the determinant | ABC|. What is then left to deal with is 
the fundamental frequency volume and we have 


fe G\|¥-1 | A- H-i4v G—t4y|_-! 
 (aByrmv) = | = 2 | . eee Ps 

| H BF x|H-}v B-B F-4r ...(16), 

iG F C¢| |G—-}y F-}3\ O-y | 


a convenient expression from which to calculate the moments. It is of the same 
form when generalised for the case of n variates. 


‘i ae 
We shall use the notation m k mi}, 
| ah g 
corresponding to the determinant |h Db fF}, 
| 
lg f ec 
7 


* Metron, loc. cit. pp. 27, 32, 35—6. 
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to represent the general product moment coefficient of the jth power of s,°, 
kth power of s.*, th power of s,*, mth power of s,s;7.;, pth power of s,s;7,, and 
qth power of s,s,7,.. These moment coefficients are in all cases except the first 
calculated about the mean of the sample, and are expressed in terms of the follow- 
ing moment coefficients of the general population : 


200 = o;’, Ho20 = o.", Koo = 03", 
Puo=9192P2, Min =%FsPis, Mou = %253P2;- 


In most cases there will be a number of similar results deduced from one 
another merely by a cyclical interchange of the suffixes, but one representative result 
for each class only will be stated. The following pages list every independent result 
up to the fourth order, and as far as eight variates. They therefore embody certain 
results which are already known. “ Student’ * first gave the one-variate moments up 
to the fourth order, while Soper+, to his degree of approximation, gave all the two- 
variate results, to the same order. His second and third order moments agree with 
the following, except for the terms in NV, which he makes - and >, in place of 


N-1 N-1 


v2 and © ye * The divergence of the fourth order results is wider, for the same 
WN? N? 


N 


reason. Isserlist later gave the general 2nd order—4 variate, 3rd order—6 variate 
and 4th order—8 variate moments, sampling from a limited population. For an 
infinite normal population his results can be expressed in terms of the p’s, and, of 
course, all the moments up to the fourth order can be deduced from his three results 
by associating the variates in all the possible ways. But his results were calculated 
about the general population value, and not, as in the following, about the mean 
of the sample. 


5. Derived Coefficients. 

We are now able to deduce certain other constants of the distribution curves 
specified by the moments cited. In particular we have for the Betas of a), i.e. of 
8, S21 12; 


; ( /°3)\)? at & 2\\° 4 Pas” (38+ Piz)? 
ay2P1 = + os F\ o , es % (1 + py?) 


( ;°4)\) ee 3) ae 6 L + 6py.? + prs! 
422 = “te jal + =3 a 
These values show that when py, the correlation coefficient of the sampled popu- 
lation, is zero, the distribution of a,, is symmetrical, but not normal. For 
6 
N-1’ 


showing that as far as the first four moments the curve agrees with Type VII. On 


B,=3 + 


* Biometrika, Vol. v1. 1908, p. 4. 
+ Biometrika, Vol. 1x. 1913, pp. 103—104, 
t Biometrika, Vol. x11. 1918, p. 183 


JoHN WISHART 43 


the other hand, when p,,=1 we have the Betas of s*, the square of the standard 
deviation of the sample, 
8 12 
28, =—— 28o.=3+ — . 
& B, pi ms l ? ‘ Be N om 1 


the well-known Type III result *. 


Since ,,,82 is always greater than 3, the curve of distribution is leptokurtic. 
We also have 
12 (l- py 


28, —38, -6 =~... 
B: 3By ) N= (1 + py’ 











which is always positive, having a maximum value V - Also 
p? (3 + p*?? ( 1 1+6p?+p*)? ( 1 2 + 5p? + 8p* + p® 
C= 1+ = bh eh oe . = 
4(1—pyr|  N-1 (l+pP )  [ N-1 (1 +p) 


showing that the curve is of Type IV for small p, but beyond a value of about *45 
(according to the size of the sample) it becomes Type VI. 


Certain correlation coefficients also follow at once from the moments given, It 
will be remembered that Pearson and Filon in Phil. Trans. Vol. 191 A, 1898, p. 242, 
gave Tec,=pse and 1,7, = P/V2 approximately. They also gave approximate 
expressions for 7g,.1,,5 Trs.r,, 20d 7,,,.,,, (pp. 256, 259, 262). In our case the corre- 
lations we are able to deduce are those between the product moments themselves 
and are exact, thus 


Yay, - 22 = {u (' i)} > fw (*:)} tH (-3)}* = Pi, 
F ot cael Pee | _f 2p:2° )2 
ara ={H (1 *)} = {HPP HCE ~ 11+ pw 


Pasian ® . (’ i) ” a (* )' (, 


; - (-4 om (--2 | ( 
renew Ye) eC) eC) (1 + px*)(1 + pis}! 
-1.-\) p22. .\)2 ( se es\)2_ __ Prs Post PrsPos 
i) = fa oe a ( 3) 


J _ ( i. We. “The 
aaa dag 7B \ )S (1 + put) (1 + pst}? 


My thanks are due to Dr R. A. Fisher, in whose laboratory this paper was 
written, and without whose critical help it would have been difficult to generalise 
the geometrical methods employed by him. 


* “Student,” Biometrika, Vol. v1. 1908, p. 4. 
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IsT ORDER: 
1 variate 2 variates 
N-1, N- 
(1) p()= - (2) B(*')= 7 She 
2ND ORDER: 
1 variate 
N-1 
(3) (2) =2 V2 o;'. 
2 variates 
haat ee ae lah FT 2 
(4) p ( i) =2 N2 01'S, Pi2"> (5) »( 7 - WV o,°o2" (1 + pis”), 
Sut, 
(6) » (* ') =2 N2 1°02 Po. 
3 variates 
z bes N=t.. fom N-1 i 
(7) » ( i) 2 v2 71 %% Piz Pis> (8) p ( ') 75 Se (Piz + prs P2s)- 
4 variates 
: > N on 
(9) Bb ae - N2 0102.03.04 (P43 Pos + P14 Pos) 
3RD ORDER: F 
1 variate 
= N-1 ., 
(10) »(3)=8 Na o,°. 
2 variates 
2. . N— as is fal 9 + 9 N - * \ 2 
(11) w( i)=8 Na 71 %2 Piz’s (12) he ( ) 7h 0,°o2°py2 (3 + prs”), 
; 9 oN-1 . eee eo eae : 
(13) p(* ej =2 V3 0;'02°(1 + 3p"), (14) pw (* 1) =4 N3 0°02" py2 (1 + pis”), . 
2 i 
(15) Be r . ) 8 - 3 01°02 Pyo- f 
3 variates 
‘ Bu: , = 
(16) p ( : i) 8 N3_ 0140203 P12 Pris; 
ss tJ |. Ft ee ee : ; 
(17) »( ) 2 N3 0 °6203” (py2” + Piz” + 212 P13 Pras), 
' - N a F ’ 
(18) p( ° ) 2 v3 9102703" (py3 + 2pi2 Pos + P13 P2s”)s 
Jo eset ee 
(19) p \ : i) 5 N3 01°02" 93" P12 P1s Pes » 
ee ee oe 
(20) » ( . :) 2 N3 01'0203 (Pos + 3p 42 P13); 
: 3 Aek as . . ‘ ’ 
(21) » ( 1) V3 01702703" (1 + pra” + prs” + Pas” + 4P12 P13 P23); 
9° L. .\ a~h .., 
(22) p ( i i) =4 v3 01°0'2°03 Pye (Ps + Piz Pos): 
‘3 N-1., 3 2 
(23) pu ( ° i) =2 N3 01°02637 (Py2 + Pis Pos + 2 pis Pis”)- ’ 
\ 
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4 variates 


= 
) a( 3 i)o4 — 07030304 Piz (Piz Poa + Pra P23); 
as oN 
“ N. 
5) a ( ‘i)=2 = N3 07020304 [Pre Pia + Pis (P12 Psa + Pis Pea + Pia P2s) | 
111 oN rs 
(26) » ¢ :)=2 = N3 ae °620304 (Pir Pss + Pis Poa + Pra Pas + Piz Pis Pra), 
11 —*¥ 
B oe cn N3 01°F 203% (Piz Poa + Pra Pos + 2Pr2 Ps Pra); 
= ee ; 
B v1 Ns 170270304 [Pys Pra + Pos Poa + Pr2 (1s P24 + PrsP2s)], 
2%, L ie 
(29) oe ~ 01°06 2°0304 [Psa + Pis Pia + Pos Pea + Prz (Piz Psa + P1sP2a + 3Ps4 P23) ]- 


5 variates 


ute a 2 N3 01°02030405 [P12 (Pia Pss + P15 Psa) + Prs (Pia P25 + PrsPea)], 


wa. 2 ¢ 
01 02°030405 [Pra P35 + P15 Psa + 2pr2 Poa Pos + Piz (Pes P35 + P2s Psa) 
+ Pos (Pra P2s + PisPzs)]- 


6 variates 


32) “ee ee N - 1 
= N3 10203040596 [P13 (P25 Pac + P26 Pas) + Pra (P25 Pse + P26 Pas) 


+ Ps (P23 Pas + Pos Pas) + Pre (P23 Pas + Pea Pss)]- 
4TH ORDER: 


1 variate 
(33) p (4) = 12~ — —— o°. 


2 variates 


(34) w (2) =12 ros" 
(35) w(23) = 4" ayo [NV — 1 + Spy? + 2(N + 1) pa) 
(36) Oa) pe ee » x50," p42 (1 + 2p), 

(37) w (18) = 6 A= DN) 5 s6,2p10(1 + pu’ 


(38) (#1) = 12 NF 9) 620, on, 
4 (N -1)(N +3) 


Ne 


; (39) uw (**) = 2+— —" o,°o," (1 + Spy"), 
4 12) gN 1, ago + (GN + 11) pe? + (N +5) ont 
(40) ph ( i) oa oy'on*[2 + (GN + 11) pys® + (N + 5) prs'], 
N-1 fe oe 
| (41) w(-*) =3- eyo, [(N +1) (i + pro!) + 2(N +5) pu’. 
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3 variates 


N-—1)(N+3 


vw 
i 
Na ~ 04°02.03 Py2 P13, 


12 ( 
See an i i 
“f Na 04027057 [(N — 1) py? 4 SP12 Ps Pos + 2(N + 1) py2?pys”), 


N-1 . : ; 
‘ N4 o;'02°o; [(N — 1) pos 4 Epis (Pis + Piz P23) + 2(N + 1) pyo*prs], 


N-1 ee ate " ‘ 2 2 
2 va o,40,°03" ((N — 1) (1 + P23") + 4 (py2® + pis”) + 8pi2 Pr Pos 


+4 (N * 1) pi2"pis”), 
N —1)(N +3) « 
: a ( - A 9 0°20; (po; + Spy pis), 


o(N-1)(N +3) , _ ; e 
2 ( 2 010203" (Py2 + P13 P23 + 4py2 P13”) , 


eee? See” 


“ 


= 4 Na 0,°02°o 3° [2 p12 prs + (N + L) pos (piz® + pis”) + (N + 5) Piz Ps Pes” |, 


= is ery 3 2\, 2 2 
2 - yi 7 "anos" [(N + 3) pyo® + 2pys* + 4(N + 2) prs prs poo 
+ (N +5) Piz" P2s"|, 


—_~ 
cS 
oa 
a= 
¥ 
tc te : w 
eee eee ee Se” —Z ee” 


11 N-1 , eee 5 eat 

1 ) = 2 va G;'02°03[2p., + 2(N 4 2) Pr2Pis + (BN + 7) prs® pos + (N + 5) pr2* pis], 
\ 
) 


=2 Na 0,°0,%o" [2 (py. + N pis p23) + fpi2 (13° + pos®) + (N + 1) py? 
+ (3N + 7) P12" Ps Pes) 
= 6 va 0,°0,%o;°[(N + L) Piz pis (1 + P23”) + 2003 (pis? 4 P13") |, 


N+3) , ; - . z 
tone Va 1°02.03" (Pra + 2pi3 Pos + 3pPy2 Pr”); 


N- Bo ae tea : : : E 
=2 Ni 0,°0°03°[(N + 1) Pis (1 + prs”) + 4py2 p23 + 2pi3° + 2(N + 2) Piz Pis 


2 t * W- 
‘ee Se “ee 
4) 

— 
a 
| 
—_ 
=a 
a 


+ 2(N t 3) Piz Pis"Posl, 


- 111 N-1 mee : , ‘ ee 
(55) »( ') "? we” oy°o3" [1 + pos? + (N + 2) (prs + prs’) + (3N + 7) Piz Ps Pos 
T (N = 5) Piz" Pis*|, 
oN - ' : Fn 
( sj=3 Ne 0, 0,°o;"[(N + 1) (Piz + P13 P2s°) + (N + 5) Pas (pis + P12 P2s)]; 


‘\ 
) 
oll N-1.,., eee ee a ae 
2) ~ Wa o,°o3°03*[(N 4-1) (1+ “P12” + SPs" P23") 
+(N +5) (P1s" + prs” + 4 p12 P13 Pos) |, 


mr 0,°0,°o° [(N + 5) P23 + (3N + 7) PizPis + (N + 1) P23" 
+ 2(N + 3) Pos (Pi2® + ys”) + (3N + 11) p12 prs prs"). 
4 variates 
i a N-1,, i oe : 
(09) an ayy Na 72 09°03 04[(N — 1) pas pos + £12 (P13 Pes + Pra Pos) 


+ 2 (N ¥ 1) pis” Ps Pra] 


RS as y T ‘ 
, ‘i o(N-1)(N +3) , ; 
(60) p» ( 5 i) =2 A - 0y°020304 (P13 P24 + Pra Pos + 4 P12 P13 Pra), 
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- Bits N _ 1 
aa ¢ 2 T 
(61) p ( ; ) =2 Nt 040270303 [(N — 1) (psa + pos pos) + 4pi2 (Pi2 P31 + Pis P21 + Pra Pos) 
+ 4py3 py {1 + (N + 1) pr2*}], 
a. T 
x a a ee ‘ 
(62) p ( : 1 :) 4 Ni 0,°0;°03°04" [(N — 1) (pi2"paa? 4 Pis’P2a” + Pra”P2s”) 
+ 4 (pis P13 Pos Psa + Piz P14 P23 Psa + P13 P14 P23 Poa) |, 
b9 T 
se ee eee 
(63) ( = ) f Nt 01°02°03°04 [(P12 P21 + Pis Poa) {2p22 + (N + 1) pis prs} 
T Pia P23 {(N — 1) ps + 4 p12 pis} |, 
: ae 7 
(64) p ( ; ) =2 Ni 0,°o3%o304[2(N + 1) pre (P13 Pia + Pos Poa) 
< (Pis Pos + Pia P23) {2 ¥ (N + 5) Piz], 
i. T 
= nn x - 
; (65) ( , : ? 2 Na - 011020304 [2 pes Pos + 2(N + 2) Piz (P13 Pos + P14 P23) 
j + pis” {(V + 3) Psa + (N +5) Ps Piss], 
ose N-—1 
(66) n( i) . Wa 02 02°03°04 [2s (Prz Pra + Pra Pos) + 2pr2 Poa (1 + 2pys") + 2N pis pos pos 
+ (N 4 1) pra* (Pia + Pis Psa) + 2(N + 3) Piz P13 Pris Pos! 
A. 7 
. Re—f.- . : : 
(07) pw ( , ) 2 Ni G,°03°o30,4 [2 {P12 Psa r Pig Pog > (A = 1) pis Post t £12 P13 Pra 
+ (N +4 1) py” (Piz Psa T 3 p13 Pea) t £2 Pos (Pos + Py P14)], 
a. ry 
(68) p ( 3) =2 va 0,°0,°0;°0," [(N — 1) pio” (1 + pa) + 2 (pis*poa® + Pra® Pos”) 
+ 40,2 {pis Pes + Pia Poa + Psa (P13 P24 + Pra P2s)} + LN pis Pia Pos Pos], 
12 i a y 
Wate. 2 ee 
(69) p ( ats ) =2 Ni 0140270304 [(N + 3) pis pu + 2 pos Po, + 2(N + 2) pre (Piz Poa + Pra Pos) 
> + (N +5) pis*pis pul, 
i. 31 N 1 : 
f (70) p ( -, ‘) =2 Na G1°02703°o4 [(N — 1) pig (1 + pros) +2 (Pr2 Poa + Pas Psa) + 2pPra (Pro? + Ps") 
+ 2 (pio Psa + PisP2) {P28 + (N + 1) prs pis} 1P12 P13 Pis Pos], 
ae 
» 5 i eee — : 2 
(71) be ( : si) = 2 va 01 °03"0'3°O7" [2po. (Pi2” + Prs") + 4 (pis Pia Pos + Piz" Pos Psa) 
3 + (NV + 1) pis {Pus (1 + P23") 4 2 prs (P34 + P23 Poa)} ] 5 
1111 r ae 
‘ a N-1)(N+3) , , 2 
(72) p ( a ) - | Ni 01° 2.0304 (Pi2 Psa + P13 Poa + P14 P23 + 3Pi2 P13 Pra); 
wag ; 
(73) x ( , de =32 NA 0110270304 [Psa+ Pos Por + (N + 2) {prs (pra + Piz Pea) + P12” Psa} 
+ (N +5) pis"pis Pia + (2N + 5) prs pra pros], 
| Ae N =e : 1 
7 (74) pw ( : ‘) =2 va 0,°09°0304[(N + 1) prs (Psa + P23 Poa) + 2 (N + 2) P12 P13 Pia 
+ 2 (P13 Poa + Pra P2s + Prz®Psa) + (N + 3) prs® (P13 Poa + Ps Pos), 
|S re N ag 
(75) p ( " ) a NA 61°09 2703704 [Py3 (Psa + 323 P21) + Pra {l + 2pi2* + pos® + (N + 1) prs? 
7? , 19 


+2 (N + 2) P12 P13 P23} i N pis (Pos - P2s Psa) + Piz Pis {2p12 pss 5 (N + 3) P13 Psa} | 
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' 
ie ge kd Pe ne ae ee ee ae 
(76) wl -. i) =2 Na 0769703704" [py2” + Ps” + Pra® + P12" Psa” + Ps Pea” + Pra” Pos” 
+ Pr prs {(N + 1) pas + (N + 3) Pos Psat + Pr2Pra {(N + 1) pos + (N + 3) pos psa 
+ P33 Pia {(N ¥ 1) Psa 1 (N bs 3) P23 Pos} |; 
pe sisi i : . . 
(77) w( i) =3 No 01° 2°0304 [4p 12 (P1s Pia + Pos Pea) +(N + 1) (1 + pis®) (pis Poa + Pra Pos) ], 
-211 7% : : 
(78) w( a Na 014070305 [(N + 1) (Psa + 2p22 pos + 3pi2" pis Pra) 
+ (N + 5) (pis Pia + P12"Psa + 2 p12 Piz Pes + 2pi2P2 Pus) ], 
f ak N re. l d : $ ‘ 
(79) ( a - Wa o,°0%0304[(N + 5) prspsa + 4pis Pos 
+ (N + 1) (3pis pes + 2pr2" P13 Poa + Pr2* Psa) 
+ 2(N +3) Piz (Pis Pia + Pes Poa) + (N +9) Piz" P14 Pros], 
| N ie 1 se 
(80) n( fy = “Wa 01°02°0'3°04 [4p y2 (Poy + P23 Pss) + 4Pis"P14 
T (N + 1) ((1 T P12") (Pra P13 Psa) r 2 (P14 P23” r P12 P13"P2a)} } 
+ 2 (A + 3) P13 P23 Pea = 2 (N + 5) P12 P13 P14 Pos > 
(81) w( i) a i G47037o3"047 [(N — 1) (1 + pry?) (1 + pg?) + 2 (pis? + pra + pas? + pos?) 
r 2N (p1sPoa” T P1a"Pes”) i L { P12 P13 Ps T P12 P14 P2a 7 Pis P14 Psa 
P23 Poa Psa + Piz Psa (Pis Pos + Prs Pos) + (N + 2) prs Pra Pos Pad], 
t 
Ed a ee P ; . > 
(82) n( = va 0,°03703°04 [ pyy {2 (1 + Pog”) 4 (A 1) (pys” t Pis")} 
+ 2(N + 2) pos (Pir Psa + Prs Poa) + (N + 3) (Piz pea + prs Psa + 2pre prs Ps Poo) 
+ (N + 5) prs pis (Piz Psa + Ps Pea) |, 
Bt:. 7 
83 oe sake) 307 + py3 + Pog? + Pag? + N (pig? + frog?) 
(83) p 1 _ eens Pre Piz’ Pos Psa 4V (Pia™ 1 Pos") 
+ Pr2"Psa” + P1s"Poa” 4 (N 2) P1a7pos” 


+ (N + 3) (p12 P13 P23 + Piz Pia Pos + P13 P14 Psa + Pos Poa Psa) 
+ (N + 7) pra pos (Pr2Psa + pis Px) + (3N 1) P12 P1s Pos Psa) 


5 variates 


. ga Op eN-1, : ? 
(84) p vo oa 0,'0,03;0,05[(N 1) (P24 Pss + Pos Psa) 4 by (Pra Pas + Pis Psa) 
+ 4 pis (Pia Pos + PrsPxa) + 4(N 4 L) Piz Pis Pis Fis]; 


4 va 01702703040; [(N 1) (P12"P 34 Pss - P13" P24 P25 1 P23" P14 P1s) 


+ 2pis Pos (Pis Pos + P15 Poa)], 


B~3... . r P 
aa 6 1°09°030405 [2 P19 (P14 Pas i P15 Psa) T 2 (pis y 2 P12 Pos) (Pia Pos F P15 Pea) 


+ 2py2 Prs (Pos Pas + Pos Psa) + 2Pr2 P2s (Pra Pss + Ps Psa) 


+ 2(N — 1) prs pis pos + (N + 1) pis {Pr2 (Poa pss + P25 Psa) + 243 Pos Post], 





sat 
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eee 
Bien Cn eee : , . . 
(87) p . , ) =2 ye SSS [(N — 1) pis? (pas + psa Pas) + 2 (p1s* Pos Pos + Pon” Prs Pris) 


+ 2pr2 Prs (Pos Pss + P25 Psa) + 2pr2 Pes (2 2Pis Pas + Pia Pas + P15 Psa) 
+ 2 (pre + N pis prs) (Pra 2s + patel). 


a 
ae SS ae ; : " a 
(88) e( 3) 2 Na 21 02°93"0495[(N = 1) pis pis (1 + Pog”) + 2 ( P12" Psa Pas + P13" P24 Pos) 


+ 2N prs Pris (Pos Pas + Pos Psa) + 2Pr2 Pra (Pes + Pes Pas) + 2pi2Pis (Pes + Pes Psa) 
r 2 pis Pia (Pss r P23 Pos) r 213 Pis (Psa r Pox Poa) |, 


bo 


t (N + 2) {Piz (Pra Pas t P15 Psa) 5 Ps (P14 Pos r P15 P2a)} 
+ (N + 3) pis pis (pes + pis Pis)]; 


N 


b9 


-] ” 
Ne 01°0203°0 495 [Py Pos + Ps Poa + 2 P12 Pia Pas + (N — 1) prs (Pas + Psa Pss) 


REE... 
ager Rak. sath fy - 
ee Na — 0402030403 [Pos Pas + Pos Psa + “P12 P13 Pia Ps 


2. Ps (p: 23 Pas + Poa Pas + Pos Psa) + (pos v 2 p12 Ps) (Pis Pas + Py4 P35 + P15 Psa) 
F (N + 1){ {P13 * (Pra Pos y Pts Pra) ar 2 P13 P1s Ps Pos} | 


Pe Fa 
a aes a = Mg " . 2 
(91) “( .. ) : va 0°0203°0 405 [(1 + 2p45") (14 Pes + Ps Pra) 


bo 


+ 2proPs4 Pas + Pes (Pia Pas + Piss) + N pis (Pos Pas + Pas Psa) 
+ (N + 3) pis pis (Pra Pas + prs Psa) + (N + 1) prs pis (p12 + prs P2s)] 
N-1 ., ‘ rear y 2 
. NA 09020370405 [(N — 1) prz (Pas + Psa Pas) + (N + 1) {p1s? (Pra Pos + Ps Poa) 


+ 2p43 Pra Ps Pest - Pia P25 1 P15 Poa 7 2 Pr2 Pia Ps T 2 P12 Ps (Pra Pss r P15 Psa) 
t 2 p13 Pas (Pog r P12 Pis) T Psa (P13 Pos ° P15 P2s) T P35 (Piz Pos Pia Pos) |; 


Ride 
, -1.. Sent Oe ; ‘ , 
(93) e( iy :) : Va 21 F293 F495” [P12 Pra (1 + ps*) + rz Pis Pas + Poa (Pis” + pis’) 


v Ps P23 (Pra r P15 Pas) F P13 Pos (P13 Pas t 3 P14 Pos) T Ps Psa (Ps Pes 1 3 P12 Pas) 
+ N {P12 P1s (Psa + Pas Pas) + Pra Ps (P25 + Pes Pss)} + (N + 1) pis Prs Pos Pss 
+ (N+ 2) Pis Ps P25 Psa] 


21. 
Bohs i rn 
(94) fag 5 Na 01°02°03 040; 5 [4pi2 (p Poa P35 Pos Psa) 3 4 p13 Pos Pos ig 1 p14 Pis (043 . 2 P12 P23) 
t (N + 1) {( 1+ Piz") (Pia P35 + P15 Psa) + 2 (pos r P12 Piz) (Prs Pos tr P1sPea)}], 
(95) p -11 - “ya 01°0'3°03°0405[(N — 1) (1 + prz®) ( Pas + Psa Pss) + 2N (pis* Pos Pos + Pes” Pra Prs) 


+ 2(N + 2) prs pas (pis Pos + rs Pes) + 2 (Pia Pis + Per Pos) + 2pas (Pis® + Pos”) 
2 prs (Pia Pos + - Ps P2 ») + prs (Pus Pas 4 + P15 Psa) + 2p. 23 (Pos Pas t P25 Psa) 
5 rn + Pos P35 + Pos Ps sa) + 2Pis Pe: 23 (P13 Pas + + Pia Pss + P15 Psa) | 


1111 
™ “( )s Vn * oi ‘02030405 [(N Pos Pas + Pos Pos + Pas Psa + 3Pi2 P13 Pia Ps) 


(N + 5) {Piz (pis Pas + Pra Pas) + Pra (P13 Pos + Prs P23) + Prs (P12 Psa + PrsPoa)}] 
4 


= 


Biometrika xxA 
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| 
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111 
(97) e( * =" = wi Gy°02.03°0 405 [2py2 (Pas + Psa Pss) + 2pr4 (P25 + Pas Pas) + 41s” Pis Poa 


+ .4 +4) {P1s” (P12 Pas + Pia P25) + 2pisP Pes Psa + (pis + P13 Pss) (Pos + Piz P1s)} 
+ 3) pis (Pxs Pas + Pos Psa) + (N + 5) p13 pis (P12 Psa + Pr P2s)], 


? 
(98) “CE 2 W a 0170270370405 [2 {Pra prs (1 + pos”) + Pos Pos (1 + pis®) + Psa Pss (1 + prs*)} 


(Pra Pos + prs Pos) {(N + 1) pis + (N + 3) PraPea} 
i teua + P15 Psa) {(N + 1) Pis (N + 3) P12 P2 ay 
= (Pes Pas y P25 Psa) {(V +4) P23 + (N + - 3) P12 Pis 3], 


tt... 
(99) “( “+ ') "Wa 0170270370405 [pas (1 + prz® + prs” + N pos") + P14 P15 {A +(N + 2) Pos"} 


+ Pag P25 (1 + prs*) + Psa pss (1 + piz®) + pes Poa Pss + 213 Pra Pos Pos 

+ Spi2 Pia (Pos + P23 Pas) + 3 pis Pis (Psa + P2s Pra) + 312 Pes (P13 Pas + Prs Psa) 
+ N {P12 Ps (P2s Pas + Poa P35 + Pos Psa) + P12 P15 P2a P13 Pia Ps} 

r (N + 1) P12 Psa (P13 Pes + P15 P23) 4 (N + 2) Pos P2s (Psa + Pis Pa) |; 


wee s« 
fa. N=}... ‘ “ 2 
(100) ( ey ) = “we 0;°0,"030,70; [(1 + p42") (Pss 1 P34 Pas) + (pig + 3 P12 P23) (pis + P14 Pas) 


+ Pog” (pss + P13 P15) 3Poa (P25 Ps sa + Pi Ps P25) 

+ N {pups + (pos + Piz P13) (Pos + pe Pas) + (2N + 1) pre prs Pea Psa } 
+ 2) prs (P15 Psa + Pris P2s P25) + (N + 3) Piz Pia Poa Pas 

+ (N + 4) pis (P12 P25 Psa + P1s P23 Pos) ]- 


6 variates 


oe Ge 
Se N = ; 
(101) p ' , ae = 2 Va 0170270040506 [(N — 1) {Py2" (Pss Pas + Pe Pas) 
' + 20 3 P14 P25 Pre + 2p15 Ps P2s Pos} + 2 (prs Pos + Pra Prs) (prs Pes - + Ps Pos) 
2x. » 13 (P25 Pas + P26 Pas) + 2 P12 Pri (P25 Pas + Pg P35) R 
Re + 2py2 Ps (P23 Pag + Pes Pas) + 2P12 Pre (P23 Pas + Poa Pss) I, 
Pee Mette N-1 , . 
(102) et 2 Na G1°02.03049506[(N — L) Piz (P35 Pas + Ps6 Pas) P1s (P25 Pas + P26 Pas) 4 
<" . 
2 t Pra (Pos Pas i P26 Pas) I P1s (P2s Pas + P24 Ps6) + P1s (Pos Pas + P24 P35) 
t 212 Ps (Pris Pas t P16 Pas) r 2 P12 Pra (Pis Pas + P16 P35) 
' r (N a 1) {P13 P1s (P14 Pos 1 P16 Poa) r P14 Pic (P13 Pas r P15 P2s)} 1, 


: 1... ,N aa bi - 
(103) py : . oH Ni 04°02.03°04059% | Pr (P15 Pas + Pre Pas) + Pra (P15 P26 + P16 P25) 


: 2 


1 2 Pr» P1s P35 P36 y 2 P13 Pis Poa Pse + 2 P13 P16 Poa Ps + p33" (Pos Pas + P26 Pas) 
+ N {py prs (P35 Pas + P36 Pas) + Prs Pra (Pes Pas + ~ P26 Pas)} + Ps Ps (P2s Pas + P26 Psa) 
+ Ps Pre (P23 Pas + P25 Psa) + (N — 1) pis Pic (Pos + Pex Psa) 
+ (p12 Psa + P14 P23) (P1s Pss + P16 Pas) |, 
oh. eS 
(104) p "tii | =2 Na 01°6'2030 "050% [ P12 (P15 Ps6 + Pr6 P35) + Prs (P15 P26 + Pre Pos) 
+ (N - 1) Piz Pis (Pse + Pas Pas) + N pus {Ps (Po Psa + Pr Poe) 
+ Pre (P24 Pas + P25 Psa)} + Piz Pra (P35 Pac + Pas Pas) + Prs Pra (Pos Pas + Pos Pas) 


t 
5 
+ pia” (P25 Ps + Pos Pas) + 215 Pig P24 Psa 
+ (P12 Psa + P13 P24) (P15 Pas + Pre Pas + 2prs Pse)]; 
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(105) p : : : se : Nt awe [((N — 1) (1 + py? *) (Pss Pas + Pas Pas) 
; 2N (p13 Pos + pis Pos) (P15 P26 + P16 P25) + 4 (pis Pra P25 Pos + Ps P16 P2s Pa) 
: 2 (pis + - P12 P23) (P15 Pas + Pre Pas) + 2 (pra + Pi2 Ps) (P15 P36 + P16 Pss) 
2 (pos + - P12 P13) (P25 Pas + P26 Pas) + 2 (pox + Piz Pra) (P 2s Pse + Pre Pss)], 
= 7 t ae 
’ (106) p» ces sok ON (P35 Pas + Ps Pas) + 2 p13 (Pos Pag + Pog Pas 
Ni 72 92% % 5% “P12 \P35 Pac * Pe Pas 2 P13 (P25 Pas + Pos Pas) 
; si 2pr4 (Pos P36 + P26 P35) r 4 15 Pis (P12 Psa + Pig Poy + P14 Pos) 
i (N + 1) {(P2s r P12 Pis) (P15 Pas F P16 Pas) 7 (Pea + P12 Pia) (P1s Pss 7 P16 Pss) 
+ (Psa + P13 P14) (P15 P26 + Pre Pos)}], 
aes 
. nt were N ie 
(107) p ae — Wa 01°06 3°0304050¢[(1 + pis") (P35 Pas + Pe Pas) 
: + (pis + Spy ps: 3) (Pis Pas + Pie Pas) + (Pos + Spiz pra) (P25 Pas + P26 Ps) 
+ 2psa (pis Pie + ~ P25 Ps) + 2 (pis Pis Pos Pos + P15 P16 Pes Poa) 
+ (P15 P25 + P16 P25) {P13 Pea 4 (N + 1) pi2 Psa + (N + 2) pus Pos} 
+ N {(Pes + P12 P13) (P25 Pas + P26 Pas) + (Pia + Piz Pos) (P15 Ps6 + P16 Pss)}] 
i ee ae 
= N- 1 
(108) pw 77. ° Ni 01 02.02°04"0;05 |(N —1) (Piz + P13 Pes) (Pss + Pas Pas) 
+ N {(Pia Pss + P15 P31) (P21 Ps + Pos Psa) + (P14 Ps + P16 Psa) (Pes Pss + Ps Psa)} 
r 2Pi2 Psa P26 Pas + P1s P24 (P35 Pas + P26 Pas) + P13 Pos (Pss + Psa Pas) 
+ 2 P14 Psa (P25 P36 + P26 P35) + 2 pos Psa (P15 P36 + P16 P35) + 2Pss Pose (Piz Psa + P13 Poa) 
+ (Pis + Pia Pas) (P26 + Pes Ps) + (Pie + Pra Pas) (P25 + P23 Ps) 
+ (P24 + Pas Psa) (2P1a Ps + Ps Pas + Ps P45) + (P36 + Psa Pas) (212 P35 + Pris P2s) J. 
7 variates 
Be wraceie s 
pike N-1 
(109) LK a. oS Ni 01° 2.030405050; [P12 Pia (P36 Psz a P37 Pss) = P12 Ps (Pe Paz + Ps7 Pas) 
; . + Pre Ps (Psa P57 + Pss Paz) + Pr2 P17 (Psa Pe + P35 Pac) + Ps Pia (P26 Pr + Paz Pre) 
+ P1s Pts (P26 Par + 55 Pas) + - P13 Pis (Pos Psz + P25 Paz) + Piz P17 (P24 Pse + Pos Pas) 
+ P14 Ps (P25 Psz + Pas) + Pra P17 (P25 P36 + P26 P35) + Ps Pic (P24 Psz + Paz Psa) 
+ P15 Piz (Pea Pas Noe (N —1) {Piz » Ps (P16 Psz + Par Pos) 
- én P26 Psz + Par Ps) + Pre P17 (P24 Pss + P2s Psa)} l 
= eee 
Dera N-1 , 
(110) » 4... 2° a [(P2a + Pr2Pr4) (P26 Psz + Paz Pse) 


+ (P25 + P12 P15) (P36 Par + Ps7 Pas) + (P26 + Piz P16) (Psa Psz + Pas Par) 

+ (Pox + Piz P12) (Psa Pse + Pss Pas) + (P13 Pea + Pra P23) (P16 P57 + P17 Ps) 

+ (P13 P25 + Ps P23) (Pis Par + Piz Pas) + (Piz P26 + Pris Pes) (Pia Psr + P15 Paz) 

+ (Piz P27 + P17 Pes) (Pia Pse + P1s Pas) + (P14 Poe + Pie P24) (1s P32 + Pr7 Pas) 

+ (Pra Por + Prz Poa) (P15 Pas + P16 Pas) + (P15 P26 + Pre P25) (Pia Ps + Piz Psa) 
) + (N — 1) {(Pss + Piz Pis) (Pac Ps + Par Pos) 
) + (Pia pss + Prs Psa) (Pre Per + Pr Pos)}]- 


23) ( 

pa ( 
' + (P15 Por + P17 Pos) (P14 P36 + Pris Psa 
+ (Pra Pos + Prs Pra) ( 


Pie P37 + Piz Pse 


4—2 
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8 variates 


okecvee N 


(111) pw 
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} 


ce ee Nt 01 020304050507 08 [ (P13 P25 + P15 Pos) (P47 Pos + Pas Pez) 

F : + (p13 Pes + P16 Pos) (Paz Pss + Pas Psz) + (P13 Pa + P17 P2s) (P45 Pos + Pas Pss) 
+ (P13 Pes + fis P23) (Pas Per + Pas P57) + (Pia P25 + P15 P21) (P37 Pes + Pas Por) 
+ (P14 P26 + Pie P21) (P37 Pss + Pas Psz) + (Pra P27 + Prz Pes) (P35 Pos + se Pss) 
+ (Pra 2s + Pis Pos) (Pas Por + P36 Psz) + (P15 Par + P17 P25) (P36 Pas + Pss Pas) 
+ (P15 P2s + Pis P25) (Ps6 Par + Ps7 Pas) + (Ps P2z + P17 P2«) (P35 Pas + P3s Pas) 
+ (P16 P2s + Pis Pr) (Ps Paz + P37 Pas) + (N -1) (pis Prva + Pra Pes) (P57 Pes + Pss Pe) 
+ (P15 P26 + Pi P25) (Ps7 Pas 





+ Pg Par) + (P17 Pos + Pig Po7) (Pss Pas t Ps6 Pas) } | - 








FREQUENCY DISTRIBUTION OF AN INDEX WHEN BOTH 
THE COMPONENTS FOLLOW THE NORMAL LAW. 


sy A. S. MERRILL. 


WHEN two sets of related measurements are made, each of which follows the 
normal law, it is usually supposed that the index, the quotient of a value from one 
set divided by the corresponding value from the other set, will also be practically 
normally distributed. Thus cephalic indices in skull measurements are stated to 
follow the normal !aw very closely. In the sequel the conditions are shown under 
which an index distribution on the hypothesis of the normal distribution of 
components will itself be close to the normal, and in an approximate way, by 
graphs, the degree to which an index distribution will differ from normality in 
cases likely to arise in practice. 

Suppose y denotes the variate in one set as measured from its mean 7; and 
that # is the second variate measured from its mean % Then the raw measures are 
7+yand +a. Assuming that r is the coefficient of correlation between « and y, 
then the equation of the correlation surface is 

ee: (= 2ray v) 
zg=ne -1-—-r\o2 oc. oe? 
o, and o, being the standard deviations of # and y respectively. 
y+y 
te 


The index i 
may be expressed as a series, for 


aa y\ a\ 
l= (1+ (1 +2) ° 


uw 
, ; y 
Designating ° by Jy, we have 
: ~—- 


y x “yu a wy ee ay 
( eZ y x “LY Pi ag “ey Wr ay 


+=+ 
Summing and dividing by their number* we obtain the mean value of J : 
[=1, {1 +0, (v, — rv.) (1 + 302 + 150,! + 1052,")} = , 
* For the mean products used here and later see K. Pearson and A. W. Young, ‘‘On the Product 


Moments of Various Orders of the Normal Correlation Surface of Two Variates,” Biometrika, Vol. x11. 
(1918—19), pp. 86 ff. 
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Frequency Distribution of an Index 


where »v, and v, are one one-hundredth of the coefficients of variation* of # and y 
respectively, and vy; is used to denote the i-th moment about the original origin. 
It may be noted that in the process of summing for this and the following moments, 
the terms of odd degree on our hypothesis vanish. We shall drop all terms of 
degree above the eighth. The terms of degree nine will therefore contribute 
nothing, and will accordingly be omitted entirely in the expressions for the 
various powers of J. The value of v, or v, is usually rather less than ‘1 In 
omitting a 2,’ or v,°, we are omitting a quantity of the order (+1), sufficiently 
small to be negligible in most cases. 


3y multiplying we obtain the following results : 
P=1} fy = Qa 4 2y 1 ae Y 4ay + y re 4c’ + “7 _ Sey" 
rT w Y a “vy y H ral ts yg y ry 


5a* 8aty Baty? = 6a 1lOaty 4a%y? Ta® 
=f ex ae : + — ome aoe - ss é a 


ey ra x wy ry x 
12a*y 5aty? Sa? L4a%y 6a°y? Oa l6a7y Tay?) 
— ae tt ee Stee ee st Sh. 
ey ny iy ry ry? hag xy rey? | 


Peal? f as 3a . 3y + 627 Lp Day m 3y° 2 102" ‘ 18a*y e 9ay? 


{ z Y Mo “Ly : y x “ey “Ly? 
ye L5a" 30a" 18ay?  3ay*® 2145 dat 
47 . — at FG : Y im, < 4 Siaty 
y a wy wy ey’ @” “ry 
30a°y? — Ga*y® + 28a° O3a%y  45aty? 10a°y® = 36a" 
ry ak a wy ry a ra - x 


4 Ber y 63a%y* Lary? 45a? * LO8a"y ‘ S4asy? i 21 ary? ] 
ay ry xy r® 


ry weap BaP ) ri 


fi=—J4 f = 4a ‘ Ay ‘ area n l6xy + by? - — + 402°y 


{ a“ y a LY yy a “ey 


24a? + 4y' 4 3524 80a" y 


60a2y? 16a" 
Ar i] 4 3 * 4 _ —— 
uA Y Y wv “ry 


ey vy? 

ys 560° 140a'y = 120a%y? 40a2y* 

+! — + _ 
y* Wh wy ey + wey 


_ day" . 84a‘ _ 224a5y . 210a*y* _ 80a*y° + 10a*y' 


vy at ey a oe ae ey* 
2 120" * nea ~ — 140a*y? * 20a*y' 
x ay Ta) xray ryt 
+ —. - 480a7y + os = 224a*y? B5arty') 
a uy a ry xry* | . 


* The coefficient of variation is usually defined to be the per cent. of the standard deviation on the 
mean. 
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; an a4 F 72 DA» 2 Qe a3 VF 2 
pte es a il 
SS Y aS “vy y Dag ay 
5Oay? 10y* TOat 175a°y 150ay? 50ay>  5Byt 
ei AP tet A ik cn Oe 


Ly y a wy ey ny y* 
; 126a° 350aty 350a*y? 150a°y® Way? y® 221025 
— = + =" - 5 bast aN = +2 : 
- x y ey a y” vy y 34 
Ly a ry ak i he vy 
3302? =1050x* 12602°y? TO0Oaty® 175a%y* 15a%y? 
ee a ee eg eg ee 
'y ey ry’ “ay ry? 
495a8 1650a7y 2100a°5y? 1260a°y®  350aty* 35a%y> 
a vy ae Y + i i oe Ay) 


8 


7 6 772 75 pd rs vy} 3 455 ™ 
a vy ey z 'Yy ny a y” j 


630a°y 700e4y2 350ey? Thatyt Say? 
_ Y , TO0sHy* _ 350ary? | T5aty y 


Ff i Z 


Summing these results and taking mean values we obtain by aid of the higher 
product moments of the normal surface the higher moment coefficients of the index 
about the original axes: 

vy = 12 {1 + 30,2 —4rv,v. + v2 + 15v,4 — 24rv,Fv, + 3v,20.2 
+ 67r?v°v2+ 1052," — 180rv,'v, + 150,472 + 60r7v,40." 
+ 945v,8 — 1680rv,7v, + 105v,5v.2 + 630r70,5v."}, 
vy = 1? {1 + 602 — 9rv,v, + 302 + 45,4 — 90rv,? v, + 180202 
} + 36770202 — Irv, v3 + 4200,° — 945rv,5v, + 135,402 
+ 5407°v,4v.2 — 90rv,*v,? — 60r%v7v.2 + 4725v,8 — 11340rv,'v, 
+ 1260v,"v,2 + 7560rv,'v,2 — 945rv,50,2 — 1260r°v, v2}, 
vy = 1,4 {1+ 100, — 16rv,v, + 6v2 + 105v,4 — 2407rv,5v, 
+ 60v,2v.2 + 1207°v,202 — 48rv,v.? + 3v,t + 12602, 
— 3360rv,°v. + 6300,402 + 2520r7v,402 
’ — 720rv,*v.2 — 480r*v,v.? + 300,20. + 120r*v,2v," 
+ 17325v,8 — 50400rv,"v, + 75600,5v.2 + 45360r70,5v2 
— 10080rv,*v,* — 13440r*v,°v, + 315v,4v.4 
+ 2520r*v,4v.4 + 840r4v,40,4}, 
ve = I3 {1 + 1502 — 25rv,v. + 10v.2 + 210v,* — 525rv,'», 


+ 150v,2v.2 + 300r2v,2v.2 — 150rv,v, + 150.4 + 31500, 
— 9450rv,*v. + 21002,4v.2 + 84007°n,4v.2 — 3150rv, vs 
— 2100r*y,*v.? + 2250,20.4 + 900r2v,2v.4 — T5rv, v5 
+ 51975v,8 — 173250rv,7v, + 31500v,°v.2 + 1890007°2,°v,? 
— 56700rv,*v.' — 75600r*v,°v.5 + 31500, ‘0,4 + 25200r*0, "0, 


+ 8400r‘v,*v.4 = 1575rv,v.> —2 1007*v,*v."}. 


It is needful to take the I’s to a high power in the ’s, if we desire to obtain 
the £’s to a reasonable degree of approximation. 
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Transferring the origin to the mean of J we have for the first four moment 
coefficients: 
lo = Ve — v,? 
= I? {v2 — 2rv,v. + v2 + 8v,4 — 16rv,2v, + 80,202 
+ 57°v,?v,? + 690," — 138rvpPv, + 150,4v.2 + 54r7n,4v. 
+ 6960, — 1392rv,’v, + 105v,°v2 + 591r?v,5v,7}, 
fs = Vs, _ Bry) Vo, + 2y,* 
= I? \6v,4 — 18rv,°v, + 6,202 + 12r°v,2v.2 — 6rv,v2 
+ 1160,° — 348rv,'v, + 720,40. + 27677v,40.2 
— 72rv,*v,' — 4470.2 + 18720,8 — 5616rv,7v, 
+ 828 ,°v,? + 478877v,°v.2 — 828rv,5v.2 — 10447°v,50,"}, 
by = vf — 4y v5 + 6,20. — 3v,4 
= I,* {3v,4 — 12rv,v, + 6v,2v.2 + 12r2v,20.2 — 12rv, v2 + 304 
+ 120»,"— 480rv,°x, + 150v,"v.2 + 570r2v,42 
— 300rv,5v — 180r*v,*v,? + 300,20. + 907r?v,2004 
+ 30540,* — 12216rv,7v, + 15588rv,"v2 + 2736v,°v2 
— 67447*v,°v,? — 547 2rv,5v2 + 2106r2n,4v.4 
+ 633rv,4v,4 + 3150,4v1, 
Bs = Vs — 5v,'vy + 10v,v,/ — 10v,/*v.' + 40," 
= [,° {60v,° — 300rv,°v, + 120v,4v.2 + 4807r20,4v.2 — 3860rv,3v.3 
— 240r*v,'v,3 + 600,20. + 24077n,2v.! — 60rv, v3 
+ 2960v,8 — 14800rv,7, + 25260r2v,"v,2 + 43400,%r,2 
— 16580r*v, v2 — 13020rv,*v.2 + 3160rv,4v,4 
+ 10260rv,*v,! + 13800,4v,4— 1380rv,"v, — 1580r"v,20,"I. 
These expressions assume a more compact form by the introduction of two 
auxiliary quantities, p. and i, defined by the following equations : 
Do = V2 — 2rv, V2 + V2, ; 
bees Vv, — 12 Ua M, 
VV? — 2rv,V.+ v2 V po 
thus the sign of X V De is defined by that of v, — rv. 
Since V7 — 27, 0. + v2 = (¥, — Tv. + v2 (1 — 7°), 
Po = pz + v2? (1 — 7°), 
hence vo" (1 — 1?) = p. (1 — A’). 
The expression p, is of the second order in v, or v; X is of order zero. 


Substituting we have 


pg = 1? po {1 + 0,7 (5d? + 8) + vy (5402 + 15) + 0° (59102 + 105)}, 
p= 1, py? v,d {6 + 4v,2 (112 + 18) + v4 (104402 + 828)}, 


pw, = 1,¢p2 {3 + 30v,2 (Bd? + 1) +o, (633A! + 2106X2 + 315)}, 
p= 1,p.2 v,r {60 + v2 (1580X2 + 1380)}. 
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Accordingly expanding in powers of »,: 


B, = = vf? {36 — v,2 (12d? — 540) + 4,4 (— 14604 + 45002 + 1917)}, 


) 


B, ="! =3 43 {v2 (20r2 + 4) + v4 (— 144 + 40402 + 42)}, 
By =** =10v,d {6 + 12 (83d + 93)}. 
Be" 
These functions are thus dependent only upon v, and X. As mentioned above, 

v, is a coefficient of variation. Its value will usually lie between 0°01 and 0-1, 
and in the case of anthropometric measurements (weight, of course, excluded) is 
found most commonly between 0°02 and 0°04. The expression for ’ shows that this 
function of v,, v, and 7 is always less than unity in numerical value. Its nature 
can readily be seen by considering, the distribution of the J if approximations to 
the first order only in v, or v, are considered. In this case 


= 1,(1 +e~3) and J=I,. 


I x 
Then =9(5-14+ ) 
y=; “4 


Substituting this value for y in the equation of the norma! correlation surface, 


we have 





; p. + a I ei 7 
Z=>2e v7 (1 - r°) {o," o1 Po Po | 
I 4 a 
Let o> F-— 1 = J — ° 
I Vi Hp} 


2 Dw» mp2 
mm ; 2 Zry . a 
Chen o? = J jo nae ie 
y LY a 
and ae ee Se ee ee 
ant On = Vg VV Vo + UY = V Po- 


Thus we have as the correlation surface 

_ ( x? 2rtw ow ) 
m , S782 \o.2" oo ' Tos 
Z Ze 1 071° Cw 


showing that 2 is, with reversed sign, the coefficient of correlation of x and @. 


In Table I are given the values of 8,, 8. and 8,’ for ?=90, *05, *1, *2, 3, “4, °5, 6, °7, 
‘8, °9, 1, and for v, =°01, ‘04, (07, 0-1. The signs are given as if X were positive, but 
since a change of sign of only changes the signs of yw, and ys, the table suffices 
also for negative values of X. The value of X? will seldom be as great as ‘5, which 
corresponds to a correlation coefficient (of « and @ as shown above) of over ‘70. 
Coefficients of variation greater than about ‘04 are also unusual. In the portion of 
the table covering values likely to occur, it may be observed that 8, is always 
small and £, differs but little from 3. For the larger values of both v, and 2’, this 
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is not the case. The value of 8, shows a rather greater deviation from zero than 
might perhaps have been expected. In the smaller values, however, it is not 
sufficiently large to make the distribution differ to any considerable degree from 
normal, 

This table can be used in the following manner in practical work. Values of 
v,, ¥, and r having been found in two sets of measurements, \* can be readily 
computed. From this value and that of v,, the approximate values of 8,, 8, and 8,’ 
can be obtained to a fair degree of accuracy by interpolation. 

The late Professor Thiele was, I believe, the first to suggest an expansion for 
frequency distributions in a series of differentials of the ordinates of the normal 
curve. Professor Pearson has given* an expansion in tetrachoric functions for any 
function, in the form 

Zz 4 , 

Wa =" * jag + F569 
in which the tetrachoric functions differ from Thiele’s functions only by a numerical 
factor. Thus if I is the variable, the argument is (J- I)/c, where o is the 
standard deviation of I and 7, is defined by the equation s 

r—-f\_ 1 d ae | -3(& 9)" 
r.(— )- e 
o \ 


“4 a) VQr0 
oC 


Table II has accordingly been formed showing the values of 


_ _ vo 
Z (B.— 3) 7; + =a i —10 VB) t.+..., 
V720 


‘,. * 6 | 
(B.—3) and eee | 
vag 8» i398 7720 (F — 10 VB;) 


for the above listed values of \2 and v,. This table shows that the values of 8,’ in 
the more useful part of the table will not shift the distribution much from normal, 
a fact which was not apparent from mere inspection of Table I. It further shows 
that for higher values of )? and v,, the fourth coefficient in the tetrachoric expan- 
sion is commensurable with the third. Since 7, is, in general+, about 4/5 as great 
as T;,a not wholly satisfactory result, to be shown graphically in the sequel, may 
be anticipated for such values. 
For constructing graphs to illustrate possible cases, the procedure was as 
follows: In Table II above the values of the coefficients of the 7’s of the tetra- 
chorie expansion were found for the values of \? and v, to be used. These were then 
multiplied by the values of the respective 7’s as given in Dr Lee’s tables for the 
values of h selected. The argument / is measured with the standard deviation as 

a unit. This process gave values of 

ae si V BT, 
5 


= = Be, 
Ya= 799 Ps — 3) 


6 
Yo = — / aoa lR ; 
Ys= F759 (Bs — 10 VB) 7. 


* “The Fifteen Constant Bivariate Frequency Surface,” Biometrika, Vol. xvi1. (1925), pp. 268 ff. 
+ See Dr Alice Lee’s ‘‘ Table of Tetrachoric Functions,” Biometrika, Vol. xvi. (1925), pp. 351—354. 
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The sum of these and 
Yo=T7, (the normal curve) 


gave the ordinates for the tetrachoric expansion (to the degree of approximation 
used). 

In the figures are shown the curves of y,, y, and y; (when possible) as well as 
y and the composite curve. 


Figure 1 shows the graph for v,=‘01 and \¥=°316(A?="'1). The +, and 7, 
contributions were so small that they couid not be shown upon the graph. The 7, 
contribution was also small, so the normal curve is a remarkably good fit for the 
curve of distribution. 


Figure 2 shows the various curves for v, = ‘04, \ = °548 (A? ="3). The 7, and 7, 
contributions are again small, but are shown. The curve of distribution still follows 
very closely the normal curve. Cases will seldom arise in practice in which the 
deviation from the normal curve will be much greater than that of Figure 2. 

Figure 3, for v,="1 and ) = 1, proves of little interest from a practical point 
of view, but illustrates well the unsatisfactory nature of the tetrachoric expansion *. 
Theoretically the case of \2=1 can arise only if >= +1, a condition of affairs 
which is seldom approached in practice. 

From the figure it is evident that the 7,, 7; and 7, terms of the tetrachoric 
expansion all add considerably to the composite curve. The 7; contribution is 
somewhat smaller than the 7, contribution, but the contribution of 7, is not less 
than two-thirds of 7, at its maximum value. All three contributions are of 
approximately the same order. We can therefore only speculate as to whether the 
tT, and 7, terms might not bring further considerable changes to the composite 
curve. The convergence of the tetrachoric expansion is here too slow to justify the 
degree of approximation we have used, and there is no evidence that if the great 
labour of computing the coefiicients of rt, and 7, were undertaken we should be any 
nearer the final goal. 

It seems, then, that for conditions ordinarily met with in practice, the frequency 
distribution of the index, when both components follow closely the normal law, is 
sensibly normal. In other cases, however, particularly in cases where the corre- 
lation is high and the coefficients of variation are large, there may be a considerable 
deviation from normality. 

[A further paper will shortly be published showing that comparatively slight 
deviations from the bi-variate normal distribution of # and y lead to marked 
divergence from normality in the index distribution, and that this is unfortunately 
the case in a number of anthropometric measurements. K.P.] 


* Pointed out by Professor Pearson, Biometrika, Vol. xvtt. (1925), p. 277. 











ON THE EXAMINATION OF FINAL DIGITS BY 
EXPERIMENTS IN ARTIFICIAL SAMPLING. 


By C. P. SUN. 


In a recent paper* by Mr G. U. Yule some interesting data are given of the 
influence of personal equation in scale readings. That such personal peculiarities 
do occur, and sometimes occur with such intensity as to render useless long series 
of data, has been for many years a familiar fact to those whose work lies in 
anthropometry. On the other hand experience shows that inequality in the 
nunibers of final digits may still exist even when the observer is highly trained 
and is reading to the nearest scale reading only, without estimating tenths of such 
a reading t. 

In many measurements in anthropometry and biometry the readings can only 
be taken to a unit, say a millimetre, and the standard deviation may well be only 
a few millimetres. Grouping by millimetres will not give us more than 15 to 25 
frequency classes, and it depends largely on where the scale of digits crosses these 
classes, which digits shall predominate. Mr Yule has referred to this matter 
in the paper cited above, p. 572, but as it is known to be a source of inequality 
in final digits, it is, perhaps, worth while noting what its effects may be experi- 
mentally. Let us take for example the upper face height on the skull measured 
in millimetres; the standard deviation is 4*mm. roughly. With a thousand in- 
dividuals there would not be more than 20 to 25 groups; with a small cranial 
series probably only 15. With series of 15 to 17 crania such as those which were 
measured by Dr Morant in Biometrika, Vol. x1v., which Mr Yule cites,even many 
of these 15 to 20 groups will have no representatives at all and this produces 
further irregularity in the distribution of digits when they are obtained from 
measurements on short series. 

Generally the mere fact of random sampling produces irregularities, often very 
considerable in the case of small samples, in the distribution of final digits, when 
the final digit is the unit of measurement and grouping. 

That this inequality may be evened out if enough series of adequate length be 
taken—it does even out the more quickly, if our frequency distributions are 
symmetrical curves, and there are enough of them—I do not deny. And, as a 


* “ On Reading a Scale,” Journal of the R. Statistical Society, Vol. xc. pp. 570—587. 

+ In dealing with Fawcett’s and Macdonell’s cranial measurements, it is difficult to see what 
Mr Yule has done, because those measurements were for lengths and circumferences taken to the 
nearest tenth of a millimetre by appreciation, and then this appreciation was used to class to the 
nearest millimetre, or in the case of some measurements to the nearest half-millimetre. Hence such 
deviations from equality of final digits are, we believe, due to the causes discussed in this paper. 
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matter of fact, I have treated this topic separately in two parts where I sought an 
explanation for the existing inequality and endeavoured to ascertain the extent 
to which symmetry and many groupings affect the frequency distributions of the 
final digits. 

Part I. 

If the series be relatively short, if some of the series be not symmetrical, but 
skew, if we allow for the usual sampling errors, will the result show even approxi- 
mately an equality of final digits for such series and such units and such standard 
deviations, relative to the groupings by reading units, which we meet with 
in craniometry more especially, and not infrequently in anthropology ? 

This is the question upon which it seemed desirable to throw some light 
by aid of experimental sampling. 

For this purpose Tippett’s Random Sampling Numbers* were used and five 
normal curve frequency distributions were obtained by the method indicated in 
the “Foreword” to that work. In precisely the same manner the frequency distri- 
butions for five skew curves were obtained by taking from the Tables of the 
Incomplete V-Functions as the sampled populations five distributions with different 
values of p. We have thus 10 distributions, with random variations from smooth 
continuous frequency distributions, partly symmetrical and partly skew, such as 
might easily arise in anthropometric practice. The normal curve frequencies were 
grouped in (a) 30 subranges, (b) 21 subranges, (c) 15 subranges, (d) 14 subranges, 
and (e) 10 subranges, and the corresponding numbers of individuals were (a) 300 ; 
(b), 152; (c), 300; (d), 152; (e), 300. 

For the five skew curves the number of subranges were: (a’), 21; (b’), 16; 
(c’), 15; (d’), 12; (e’), 11 respectively; and the number of individuals in each 
frequency: (a’), 152; (b’), 200; (c’), 152; (d’), 200; and (e’), 152 respectively. 

Such distributions might quite well represent millimetre measurements of 
ten characters on the skull, although the numbers are large for cranial series. 
Ten cards were now taken with the ten digits 0, 1,...9 inscribed on them, and 
drawn one at a time and replaced in the bag. Each draw settled the digit with 
which the frequency distribution was to start. The frequency of the final digits 

was then found to be given by Table I on p. 66. 

Any one examining the totals column would assert that the observer had a 
marked prejudice in favour of the first five digits, 0—4. Yet the whole distri- 
bution is the result of random sampling from continuous symmetrical and asym- 
metrical distributions and of placing the scale down on the marginal frequencies 
by chance! If it be said that the number of subranges is too small, we may cite 
cases like the measurement of stature; no wise anthropometrician would record 
it to more than the nearest inch, because stature is capable of varying, possibly 
1 inch, according to the time of day at which it is measured. Thus, 1000 cases will 
only provide about 18 groups, and 200 possibly not more than 14 or 15. Forearm 
in adult women tabled to the half-inch, which is as close as can be reasonably 

* Tracts for Computers, No. xv. Cambridge University Press. 
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ascertained, only gives 14 groups for 1000 cases, and for 200 cases might easily 
give only 10*. When we come to short cranial series, some of which Mr Yule makes 


TABLE LI. 





— (a) (b) | (c) (d) | (e) (a’) (b’) (c’) | (d’) (e’) Totals 
OE eR a ta 
0 40 13 | 36 13 17 11 39 a | 7 18 239 
1 38 17 | A7 13 56 LO 26 31 13 12 293 
} 24 12 oS) 19 72 17 24 22 33 6 274 
} 28 17 12 21 63 Ld 17 18 24 10 255 
4 25 22 14 21 11 23 19 10 25 5 235 
H 29 20 20 18 32 23 9 15 i) 6 177 
6 29 13 24 14 9 15 6 8 | 11 10 139 
7 28 16 17 15 l 14 12 | 13 3 153 
8 28 12 1 12 3 9 19 9 8 30 141 
9 3 10 14 6 6 15 29 LO Li 22 154 
= | — = a | 

N 300 152 | 300 152 300 152 200 152. | 200 152 2060 





use of with only 15 to 17 individuals, it is clear that we shall get down to 
10 groupings, or the position of the scale against the distribution curve of the 
sampled population and the large variation of the random sampling for small 
numbers are factors of great importance in determining the distribution of final 
digits. Table II gives our results compared with the hypothesis of an even 
distribution of final digits. The coefficient of variation is 27°284°/,, and the 
x’ = 153°354; for such a value of x? and for n’ = 10 groups, it is generous to put 

P = 000,000 ; 
in other words, the result is practically impossible on the basis of random sampling 
from equally probable final digits. 

TABLE II. 











Final Observed | Theoretical age / | ede 
Digit Beppe F sat’ Seen f'-f fv -JP | W-T¥e 
< | 
0 239 206 33 1089 5286 
1 293 | 206 87 7569 36°742 
2 274 206 68 41624 22°446 
3 255 206 49 2401 | 11°655 
| 4 235 206 29 841 1-082 
5 | 177 206 29 841 | t° O82 
| 6 139 206 —67 1489 21°791 
. 4 153 206 —§9 2809 13°635 
| 8 141 206 —65 41225 20°509 
| 9 154 206 — 52 2704 13°126 
| 
Totals 2060 2060 o=56'21 | y?=153°354 
| 


* The temperature of normal individuals can hardly be read to advantage closer than 0°1°, and 
100 normal cases will only give 10 groups. 
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) In the above investigation we have taken rather long series for craniometric 
| measurements, 150 to 300, thus reducing the variations due to random sampling, 
but this example seems to suggest that one must be very cautious in assuming 
that every case in which the final digits of an observer are very unequal and give 
an impossible “ P” is due to personal predilections. The illustration may also 
account for deficiencies or excesses of certain digits which can hardiy be the 

subject of prejudice at all. 

Part II. 

The first part of this investigation has demonstrated the distribution of final 
digits as they may occur in craniometrical and anthropological works with relatively 
short series and when each group corresponds to a digit; and the present part 

’ ‘ ° on pes . . 
attempts to show what changes it would make, if we confined our attention entirely 
to those distributions which have a fairly large number of frequency classes. In 
precisely the same manner as before, three normal curves and three skew curves 
not identical with the previous ones given in Part I were obtained. The number 
of subranges for the normal curves is: (a), 23; (b), 28; (c), 27; for the skew curves: 
(a’), 26; (b’), 24 and (c’), 25 respectively. 

Such distributions might represent anthropological units. Yet the distribution 
of the final digits res:lting from mere chance, and without personal predilection, 
is shown by the following table : 

i TABLE IIL. 

) “Final |. ta ia, ae Tr) = - 

Sina | , 

Digit (d) (c) (a’) (b') | (e’) | Totals | 
—— ‘is ‘ei A ae | | } 
0 23 20 10 10 a | ed Se 
I 17 | 24, 12 | 22 | 19 | 9 | 103 | 

4 14 23 LO 16 16 | 24 103 

9 16 14 19 21 13 92 
hb 15 16 9 12 12 l4 | 78 | 
5 16 14 16 3 15 15 | 99 

; 6 13 13 16 0) 20 ll | 938 
IS 26 ll 1035 i3 | 13 | 
8 18 28 8 10 24 | 15 | 103 
9 7 20 19 8 15 5 74 | 
Beh 
Vv |150 | 200 | 125 |150 | 200 | 125 | 950 | 
| 





Examining superficially the data, one would say that in a distribution like 
this, there was a special predilection for 1, 2, 5, 7, 8; indifference toward 0, 8, 6; 
and unusual prejudice against the digits 4 and 9, This we hold has no basis at 
all, but is a matter of pure accident. Here again we meet with inequality of final 
digits, even with more groupings and far greater frequencies than are usual in 
craniometrical measurements. 

We may again compare this distribution by aid of the (P, x’) test with the even 
occurrence hypothesis. The value of y? is 13°4943, which gives a P value of ‘1427, 


5—2 
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Relatively speaking, it is more probable than the results recorded in Part I, but 
it is by no means markedly significant. Since we suspect skewness must have 
played an important réle in changing the predominancy of certain digits, the x's 
for the symmetrical and the asymmetrical distributions were calculated inde- 
pendently; and they are 6788 and 16979 for the normal and skew curves 
respectively. These give the corresponding P values of ‘6589 and ‘0491, the 
probability of occurrence being quite good for the normal curves, thus indicating 
concisely what symmetry can do in increasing the chances of even occurrences. 

As the starting of the digits is perfectly arbitrary, we might have summed up 
all the observed frequencies both in Part I and Part II, maintaining the same 
serial order, thus: 

276 232 266 244 228 331 396 377 347 313 

4 i) 6 7 8 9 0 1 2 

when it would have been explained as a case of prejudice in favour of the whole 
number, or its immediate neighbourhood, if our last digit was a decimal figure. 
Accordingly, while fully realising in how many cases careless or prejudiced readings 
are an important factor in the inequality of the numbers of final digits, we believe 
that other factors come into play which even the most careful and physically trained 
measurers cannot avoid, and that Mr Yule, while recognising their existence, has 
possibly not sufficiently emphasised them, This emphasis is provided by the above 


experimental sampling, which is not far removed from cases actually occurring in 


anthropometry, and in particular craniometry, although in the latter case the total 
frequencies must be considered as exaggerated. 

































SIMILARITIES AND DIFFERENCES IN THE 
OF THE MULBERRY SILK-WORM®*. 


SPECIES 


t By Proressor A. V. ANOOCHIN, Moscow. 


SILK-worM breeders and dealers in raw silk have for long been greatly 
interested in the possibility of a comparison of the different species of the mulberry 
silk-worm, the aim being to find owt which species is the most profitable and to 
what relative extent. 

Silk-worm breeders approach this question quite simply. The species are com- 
pared on the basis of a large amount of data and then conclusions are drawn. 

Here is an example of such an investigation. Ivanoff+, comparing the qualities 
of the Baghdad species with others, writes much as follows: 

In its percentage contents of silk in cocoons the Baghdad variety stands 
slightly below the European species, occupying the fourth place, as seen from the 
following tables which represent the standard numbers obtained in the last ten years 
in the laboratory of the Caucasian Silk- worm Breeding Research Station. 


TABLE I. 


Percentage Contents of Silk in Cu: oons. 


) Silk 
1 Chinese White ........<... 36—35 
2 European ,, : o4 
S PSAPANCHC  ovccewssecsceccese ¢ -33 
ees 32°5 
5 Middle-Asiatic 





6 PRUNUMAR. saucsuvedwess 
i, - SORIIRD oc nase eeeceiaccce 26 


In all other respects it is either superior to European species or almost equals 





' them, which is shown in the following tables: 
TABLE Ila. TABLE Id. 
Length of Thread in a Cocoon Strength of Thread (expressed by 
(the averages of ten years’ weight wn grams of Silk-thread 
research), of a given Length). 
metres grams 
D.  & ondivcscassioes 866 ie, oS Ree en 38°0 
SS es 724 2 Middle-Asiatic ......... 36:0 
8 Middle-Asiatic 723 ee ee 35°8 
Br Ro ctnsancactetthstxas 717 , Japanese........... . 351 
eS ee 639 a .. 33 
i iS UNOS <sonccccesvadeeeas 580 * 6. Bagndad .........ssosscess 33-4 
€° MOBBONG, ..3'ss0cisesscceesven: O80 Y FEUTONCAN a cacceseseesses 33°3 


* Memoir prepared in the Central Silk Experimental Station, Moscow; Director, B. Michin. 
+ V. P. Ivanoff, ‘On the question of the Baghdad Species and its Replacement by European and 
Crossed Species.” News of the Experimental Silk Station, 1915, Tiflis, 
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TABLE IIc. 
Elasticity. 
(Extensibility 
European 
Middle- Asiatic 
Baghdad 
Persian 
Caucasian 
Japanese 
Chinese 


Thus far Ivanoff. 


in Species 





Amongst the Moscow manufacturers one hears the opinion that the thickness 


of Mulberry Silk-Worm 





of the silk-thread of the Baghdad variety is very uneven, so that it is impossible 


to spin it into an even thread. 


To prove this statement I mention the results of examination of three species: 
Ascoli Yellow from the firm Paskvaliss, Crossed Spherical—firm Paskvaliss and 
Baghdad White—firm Pinachi. The cocoons were obtained from the Caucasian 
breeders and were of their own breeding and from the same district. 


Length in 
Hundreds of Metres. 


of the separate silk-threads of the Baghdad variety is greater than that of the first 


Weight of each h undred Metres expressed in Milligrams. 


a. Askoli Yellow. 


Cocoon 


For each variety five average cocoons were taken; the thickness of the thread 
is determined by the weight of each hundred metres expressed in milligrams. 





— 


1 B ( D 
30 3D 35 35 
3D 10 35 10 
35 10 35 35 
30 35 30 35 
20 25 25 25 

20 25 20 
20 





two varieties, which are considered to be the best Italian. 


; 
} 
TABLE IlId. 
Thickness of Thread. 
RRO ioc ee cuccoweceese 137 
2 Caucasian ........... .. 139 
S CRDANGR,....05.0060006500 149 
4 Middle-Asiatic ......... 153 { 
BB codecs deacsentsts 159 
G European  ......:.......: 172 
Ff TUIED c cvnesicovesceaces 173 
. ° . . ° ? 
Laboratory investigations do not confirm this. 
TABLE III. 
' 
Crossed Spherical. c. Baghdad White. } 
Cocoon Cocoon 
I ( D k I B C E 
10 30 10 25 LD 3D 10 35 
30 35 10 3D 10 35 10 30 
30 30 30 3D 10 30 35 35 
25 25 25 20 40 30 35 35 
25 15 30 35 25 30 35 
25 35 25 25 30 
95 | 20 20 30 
20 20 25 


From these facts we can by no means conclude that the variation in weight 


What is the general deduction from this? From all these facts one concludes 


that in its technical qualities the silk of the Baghdad species stands by no means 
so low as some Moscow manufacturers maintain, 


The conclusion is quite unexpected. In fact in length of thread it holds the 
first place amongst other species, in elasticity the third, in percentage contents of 
silk the fourth, in strength of thread the sixth, in thickness the seventh, Are then 
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its qualities superior or inferior to those of other species? One does not know. 
In one quality it is better, in another worse, and it is impossible to draw a final 
conclusion for two reasons : 

The first lies in the fact that one cannot compare species according to separate 
characters. One always obtains the result which is so well expressed in the 
following words of Serebrovski: 

“On account of transgressive variation it is impossible to select a character 
which would serve as an absolute criterion for the distinction of one species from 
another. According to one of its characters the individual resembles more closely 
one definite species, according to another it may resemble more closely a second 
species *,” 

The second reason we have already discussed in our work, “The Changeability 
of Suk-Thread+,” wherein it was said that in order to find the standard characters 
which define a species it is necessary to examine the greatest possible number of 
individuals, With a few individuals we may commit great mistakes, since every 
character forms a variational series and accidentally selected individuals may 
either correspond to the standard type, or may diverge from it on the positive or 
negative side. Therefore such individuals should not be compared with each other, 
especially in the case where they come from different species. Our results}, from 
the comparison of the white Baghdad variety with the Askoli and Chinese-golden, 
bring us to quite a different conclusion, since we find that, according to all 
characters, except thinness of thread, the Chinese variety occupies the third place. 
TABLE IV. 


| 


SF . Weight of Length of Thinness of er 
| Frison* | ‘Thread Thread Thread | Dassinett 
| Se Sees aren = Een) ait i a 
| 
Baghdad ... Sats 11°93 253°43 757°58 “34 31°79 
Askoli bee ee 35°84 | 205°52 781°27 if | 16°40 
Chinese golden oe 19°55) | 187°66 607 °26 a | | 10°81 





* The flock silk or curly which must be taken off before the end of the silk filament for reeling can 
be found. 

+ When the reeling is finished the pupa remains enveloped in a silk enclosure, the ‘‘husk,” which 
is not reeled off and is called the bassinet. The bassinet varies with the breed of silk-worm in 
weight. 

In spite of these apparent indications of the possibility of comparison between 
the three mentioned species, we still cannot draw any conclusions, since on the 
one hand the Baghdad variety occupies the second place in length of thread, while 
on the other, if we increase the number of species, we shall become confused in 
the estimation of their qualities. 


* Serebrovski, ‘‘ The Statistical Method in Biology,” in ‘‘ The Statistical Method in scientific in- 
vestigation.” Academy, 1926. 
+ A. V. Anoochin. Central Silk Research Station, 1927. 
t+ A. V. Anoochin, ‘“‘ The Changeability of Silk-Thread.” Central Silk Research Station, 1927, 
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The only proper method is the one based upon the laws of large numbers. In 
which case also the degree of similarity ought to be expressed by a single coefficient 
embracing all the most important characters of the two species under examination. 
Such a coefficient would furnish the means of finding the degree of likeness of two 
or more species, which approach each other nearly or remotely, and of calculating 
by how much the degree of likeness of any two exceeds the degrees of the others. 


The first attempt of this kind gave brilliant resuits, and was that of Heincke. 
In his monograph, The Natural History of the Herring, Heincke put before himself 
a whole series of questions, one of which was the following: Does the herring of the 
northern European seas represent one species or is it a group of local species 
which can be clearly distinguished from each other? A question extremely difficult 
to solve since these herrings possess very similar features, and although they are 
subdivided into species these latter approach each other very closely. Heincke 
applied the method of variational statistics and set himself to study the variations 
in senarate characters in the individual herrings; he performed 100,000 measure- 
ments and found from these that the characters of the individual herring vary 
according to the law of error and give a typical curve. Of such characters there 
were 60. If one compares the curves of the characters of individual herrings with 
those of the supposed species it becomes apparent that each species has its own 
average value for every character. It follows that for the definition of a species it 
is sufficient to take the combination of all standard (average) values of the different 
characters. When this was done one obtained the expected result, i.e. the greater 
the geographical distance between the species the greater are the differences be- 
tween them. 


The next task was still more difficult—that was to find a means of distinguishing 
one species from another. By aid of only one character it is impossible to achieve 
this, as explained above. It is necessary to bear in mind the combination of all 
characters in order to do it. Heincke’s method was the following. He found that every 
variational series is characterised by the fact that the sum of the squares of devia- 
tions from the average value is minimal, i.e. if we compare the sum of the squares 
of deviations from the average value with the sum of the squares of some other 
number, the former will always be the smaller. From this it follows that the 
animal in question will belong to that species, with respect to whose average values 
the sum of the squares of the deviations of all its characters will be minimal. 

Thus, if we calculate the sum of the squares of the deviations of a certain 
individual from the average values of the same characters of all known herring 
species, the smallest number will indicate to what species the individual belongs. 


If, therefore, at the end of the calculation the following results be obtained : 


for the White Sea Herring. . , : ‘ 3'213 
» » Spring Norwegian Herring . ; ; 3°696 
» » Jutland Herring . ; : : - 6317 


it means that the individual in question belongs to the White Sea variety, 
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This method of Heincke has not met with wide application, apparently for the 
reason that it involves an enormous amount of measurement and calculation. 
We also hesitated to follow it and waited for help from the biometric-mathema- 
ticians to simplify it. At the same time we regard the method devised by 
Heincke as the most correct* ; the problem is to diminish the amount of measure- 
ment and caleulation. 


In the search for such a coefficient we met with the work of Pearson, “On the 
Coefficient of Racial Likeness,” Biometrika, Vol. xvut. Parts I and 11, 1926, p. 105. 
This coefficient has been proposed by him for craniologists and just answers our 
requirements. 


“Every craniologist and indeed every physical anthropologist has come up 
against the difficulty of comparing two races of which it is only possible to secure 
a limited number of individuals of one or other or both races. Not unnaturally he 
is driven under the circumstances to seek help by measuring a large number of 
characters in order to compensate for few individuals. We have frequently to 
admit that relatively few individuals are available in many anthropometric in- 
quiries, and that we really must compensate for the smallness of our sample by 
the largeness of our character series.” 


“But how is this to be done? We can compare the means for our two small 
groups character by character, and if we are trained statisticians we shall compare 
these mean differences with their standard deviations. But when a considerable 
number of the characters do not show differences markedly significant with regard 
to their probable errors, we are left in considerable doubt as to what inference may 
be safely drawn from the whole series. We need a single numerical measure of 
the whole system of differences, something which will express by a single coefficient 
the measure of resemblance (or divergence) of the two races or groups. Such a 
measure or coefficient I term a Coefficient of Racial Iikeness (C.R.L.). It should 
be a measure, not of how far the two races or tribes are alike or divergent, but of 
how far on the given data we can assert significant resemblance or divergence.” 


Pearson’s Coefficient of Racial Likeness has the following numerical expression : 


YY p Al (m,— my)? | A td mf 
=: % .=S —— Se a ° = 
C.R.1 Ul|<o:o, }} 1+ ay + 67449 MV 
“ee, 
n nm | 


ms; and my are the averages of the same characters of two species, o; and oy 
their standard deviations, » and »’ the number of individuals, M the number of 
characters and S the sum of values in the large brackets for the whole number of 
characters. The ideal value of M is, according to Pearson, 40—50, but in practice 
24—25 is quite sufficient for small values of », n’. In any case the greater the 
values for n, n’, the smaller the error, and the more correct the coefficient. 


* [I venture to think the method essentially erroneous. It could not be applied to characters 
measured in different units and it entirely neglects the differing variabilities of the several characters 
which are therefore improperly weighted, the larger characters being weighted with their size. K. P.] 
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We calculated the C.R. L. for cocoons of the Baghdad white, Askoli and Chinese- 
golden variety on the basis of eight characters (longitudinal diameter, transverse 
diameter, weight of complete cocoon, weight of the cocoon-envelope, weight of frison, 
weight of bassinet, weight of silk-thread and length of silk-thread). The small 


number of these was compensated by the great number of individuals, 250 cocoons, 


As a result we obtain the following numbers: 
TABLE V. 


Coefficient of Racial Likeness. 


: , : | 
Baghdad Askoli Chinese-golden | 
Baghdad ... ane 396°151L+°3387 «| | =565°2304°337 | 
Askoli ... as 396°151 + °337 9°792 + °380* 283°256 + °337 
Chinese-golden ... 565°230 + °337 283°256 + °337 — 


One group of Askoli from Astrachan, the other from Moscow: see below, p. 70. 


Great numerical values of the coefficient made us mistrust its practical use, 
the more so, since when applied by craniologists its numerical expression never 
grows beyond two digits, even in the case when the difference between the 
characters of the compared skulls is very great. Besides this, large numerical 
expressions are inconvenientt. 


For the sake of verification we introduced a ninth character, the thinness of 


thread. As a result of this enlargement the numerical value of the coefficient at 
once diminished considerably. 


TABLE VI. 


| 

Baghdad | Askoli | Chinese-golden 

| | 

a : 

| | | 
Baghdad ... ai } 374°023+4°318 512°433 + 318 
Askoli... ... | 374°023+°318 257 °709 + 318 

| Chinese-golden ... 512°433 + °318 57°7O9+°315 | 


The decrease of coefficient indicates that with a greater number of characters 
its numerical expression will become smaller and more correct, since parallel with 
this the amount of error decreases, depending entirely on the number of characters 
considered. 


+ [Similar large values have been obtained when the coefficient is applied to measurements on the 
living of different races. Even allowing for the great differences of technique in such measurements, the 
high values seem to indicate that racial differences lie to a greater extent in the fleshy parts than in 
the skeletal frame of Man. K. P.] 
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Pearson considers 40—50 examinations as ideal and 24—25 as sufficient. But 
we have only considered nine characters, therefore we obtained a greater numerical 
value for the coefficient. Pearson points out that in the case where one selects “ the 
most diverging or the nearest alike characters ” for two species, the amount of these 
does not affect the C. R.L. 


In order to verify that the coefficient really does show up the likeness or 
divergence of species, we calculated it for two groups of cocoons of the Askoli 
variety (each group 250 cocoons). One group of cocoons was obtained from 
Astrachan dating from 1922, the other dating from 1926 was received from the 
Central Silk-worm Breeding Station (Moscow). The following six characters were 
selected for comparison: the longitudinal and transverse diameters, the diameters 
of the two bulgings on the cocoon, the complete weight of a cocoon and the weight 
of the envelope. The coefficient was equal to 9°792 + °380. 


According to Pearson the coefficient fluctuates round zero with the probable 
[2 ; 
error of 67449 , TE when both groups have been taken from the same species. 


Our C.R.L. is greater than zero, apparently because only a few characters were 
considered. But it differs so definitely from the C.R.L. of two different species, 
that one can recognise its use for judging likenesses and differences between the 
species of the mulberry silk-worm as a proved matter*. 

In his work, Pearson points out that “the greater the number of skulls the 
greater the coefficient,’ and before that, “I next proceeded to consider what in- 
fluence the number of individual crania dealt with had on the coefficient. Con- 
sidering only coefficients based on all characters (i.e. absolute size and indices and 
angles), I obtained a correlation table of 580 entries, entering each coefficient twice, 
once for each race dealt with. The table was still more ‘lumpy’ than the previous 
one, as the skull frequencies run from 6 to 885. The answer I found for the Corre- 
lation of the C.R.L. and Number of Crania used was 

r=+'1270 + 0276, 


i.e, there was a not very important correlation between the number of skulls used 

and the coefficient, the greater the number of skulls the larger the coefticient+.” 
We have obtained such high values for our coefficients, since we chose a fairly 

large number of individuals, 250 cocoons. The diminution in the number of in- 

dividuals has for us a threefold meaning: 

(1) To obtain a smaller coefficient in the numerical sense. 

(2) A considerable diminution of measurement and calculation. 


(3) If the obtained coefficients are in the same ratio to each other as when 
calculated from a small number of cocoons, we shall have one more proof of the 
* [Even the coefficient 9°792 is probably explicable on the basis of differentiated Askoli races, either 


by inheritance or long continued environment. K. P.] 
t loc. cit. p. 116. 
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correctness and possibility of practical application of the C.R.L. to the species of , 
the mulberry silk-worm and therefore to species of any other animal. 
For this purpose we undertook a final verification of the formula and calculated 
the C.R.L. for the same species and for the same nine characters, but for any 
casually chosen ten cocoons, and obtained the following results : 


TABLE VII. } 








| Baghdad Askoli Chinese-golden | 
ee ee, = Pee SI 
Baghdad ... — 50°313 + °318 110°295 + °318 
Askoli en re 50°313 + °318 “505 + *380 37°731 + °318 
Chinese-golden ... | 110°295+°318 37°731 + ‘318 i \ 


The coincidence is striking. The coefficients preserved their ratios and more- 
over the C.R.L. for other groups of Askoli has received the numerical value which 
Pearson attaches to it—it is near to zero. We think that the latter fact verifies 
most conclusively that the formula for Pearson’s C.R.L., which is employed by 
craniologists, can also be used for the purpose of defining the likenesses and 


divergences of species. 


As a result of the analysis by aid of the formula for the C.R.L. we can draw 
the following conclusions : 


(1) The likeness is greatest between the Askoli and Chinese varieties; the 
least between the Baghdad and Chinese varieties; the Baghdad variety is nearer 
to the Askoli than to the Chinese-golden variety. 

(2) Taking arbitrarily the Chinese-golden variety as the standard one, one can 
say according to (1) that the Askoli occupies the second place after the Chinese 
species and the Baghdad species the third, i.e. the latter is the worst of the three 
species examined. 

(3) According to the preceding statements (1) and (2), or from the comparison 
between the Baghdad and Askoli varieties with the Chinese (the first and second 
ratios), it follows that the Baghdad variety diverges from the Chinese three times 
further (110:38) than the Askoli from the Chinese, which can be seen from the 
following numbers : 

Degree of 


C.R.L. Likeness 
1 A. ° , Ch. : 38 ; ‘ 1 ; 
2 B. ; . Ch. . . 110 ; ; 3°0 
(4) If one calculates the C. R. L.’s for all species of the mulberry silk-worm and 
places these in a certain order, one obtains a genealogical table which may solve 


ww 


indirectly the problem of the origin of species and show the degree of their relation- 
ships. For this purpose one may compose, as is done by craniologists, a table of 
relative degrees like the one proposed by Pearson for human races*. 


* Biometrika, Vol. xvut. p. 112 
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(5) Such a table of degrees of likeness furnishes a means of orientation for 
the origin of species from each other and provides aid in investigating it. If for 
instance the C.R.L. of two species is very small and these species have only one 
strongly divergent character (for instance, thinness of thread), the investigation of 
their origin ought to be carried out in one of the following three ways: Find whether 
they have one common ancestral form with a lesser thickness of thread, or whether 
there are intermediate varieties giving a gradual transition from one to the other 
which we do not yet know, or finally whether one sprang from the other by the 
process of mutation. 


For the solution of these problems it is necessary, as already mentioned, to find 
the C.R.L. for as many species as possible. This involves a large amount of 
measurement and calculation, and it is necessary to standardise the list of characters, 
their quality (which particular character) and the number of cocoons. 


(6) Practice suggests that cocoons of the first generation of crossed parents are 
better than the parent cocoons. The determination of the C.R.L. of these cocoons 
and of either parent species will show us the likenesses and differences between 
these cocoons and also to which parent species they are more closely related. If 
the difference is great, one ought to analyse in detail the cocoons of the first gene- 
ration, if it is not there is no need for this, since the cross-bred cocoons differ only 
slightly. 


(7) The C.R.L. gives us the means of finding the origin of an unknown species. 
If we discover cocoons of such a species and suppose that according to its appearance 
it resembles some Eastern species, for instance the Baghdad species or Buchora 
species, we only have to calculate the C.R.L. for the unknown species and for the 
supposed related species in order to find which species is most nearly related to it. 
This, in its turn, will enable us to judge its qualities. 


(8) Having arbitrarily chosen some species as the standard one and compared 
with it any other species, the quality of which we want to determine, we make use 
of the C.R.L., determining it for each of the species, and thus find the likeness and 
the divergence. After having thus compared a whole series of species with the 
standard one, using a standard number and quality of characters and a standard 
number of cocoons, we can compose a table of C.R.L.’s placed in ascending order 
of magnitude. Such a table makes it possible to find how great. a likeness or 
divergence of certain species may be, as already mentioned under paragraph (3) 
of this Summary. This possibility is the result of the fact that the numerical expres- 
sion, the C.R.L., loses its abstract character and obtains a concrete meaning, 
showing the degrees of similarity and difference. 

In conclusion, I may put on recérd the means (J/), the standard deviation (¢), 
and the total frequencies used in computing the above coefficients of racial likeness, 
which may be found useful for further investigations. 
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TABLE VIII. 


Unit of Baghdad | Askoli Chinese-golden 
OE re) — 
ment 
M | 5 } on | M o n M o 
| 
= -——| | se | 

Longitudinal diameter... mi. 39°146 | 2°546 | 250 | 32°180 2°561 | 250 26°126 2°238 
Transverse diameter wick oe 18'114 L268); .. 15-410 2°247 17°407 1°493 
Weight of complete cocoon mg 1°655 *430/ ,, "624 “106 "487 ‘160 
Weight of cocoon-envelope ne "322 620] ,, "264 "045 | ,, "175 “O39 
Weight of frison ae - $1 °932 21°511 ea 35844 20°885 _ 19°550 11°853 
Weight of thread re ss 253°427 | 50°294) ,, | 205°520) 46°900; ,, 187°655 | 36°935 
Weight of bassinet a = 31°785 | 13°923] ,, 16°398 8826 ,, 10°810 2°802 
Length of thread 7 mm. 757°576 | 158°120| ,, | 781°272|)159°210| ,, 607-261 , 125°197 
Thinness of thread «+. | mM./mg. "340 ‘O60 < "270 "053 310 "055 
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' (6) On the Quantitative Interrelations of the Dominance of 
the Ocular Characters ; : : ‘ : ‘ 101 
: On the Quantitative Interrelations of the Dominance of 
the Manual Characters : : . : ' : 108 
4. Is there any Truth in the Theory of Universal Laterality? . 121 
a) Onthe Relationships in Percentage Occurrences between 
the Dominance of Ocular and Manual Characters. 121 
(6b) On the Relationships in Intensity between the Domi- 
nance of Ocular and Manual Characters . : ‘ 127 J 
5. Is Laterality a Continuous Variate ? , : ‘ ; ‘ 140 
2 6. To what extent is the Correlation of Lateralities a Function of 
the absolute organic correlations ? : : ; : 143 
IV. Summary and Conclusions : - : : ; ‘ : : 146 
I. INTRODUCTORY. 
: IN a preliminary study of this subject* based on data collected by Sir Francis 


Galton, it was shown that the theory of “absolute laterality,” or the theory that 
sinistrality in one character means sinistrality in all, certainly did not apply 
to left-handedness as measured by grip and left-eyedness as measured by acuity 
of vision. Not only were the percentages extremely different for sinistrality of 
hand and eye, but there was a zero correlation between the difference of grip of 
‘ the right and left hands and the difference of acuities of the right and left eyes. 


* Biometrika, Vol. x1x. pp. 165—199. 
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The topic was there discussed on the basis of a very large number of measured 
individuals, 5000 to 7000, but only on the basis of the two above-mentioned 
characters. 

The present paper carries the matter further by dealing with a far larger 
number of both manual and ocular characters, but on a far more limited range of 
subjects, about 400. 


In this paper it seems unnecessary to repeat the history and criticism of the 
various theories of the subject, which appeared in the former memoir. We shall 
confine ourselves to an endeavour to answer the following questions. 

(1) For the population as a whole, for each character of hand and eye, which 
side is dominant and by how much? 

(2) Is the hand dominant for one manual character also dominant for a second ? 
Is manual laterality absolute ? 

(3) Is the eye dominant for one ocular character also dominant for a second ? 
Is ocular laterality absolute ? 

(4) Does the side which provides the dominant eye for one or all ocular 
characters provide the dominant hand for one or all manual characters? Is the 
theory of absolute laterality true for any or all characters? 

(5) Is either manual or ocular dominance in itself a clear-cut characteristic, 
i.e. can we always say that a particular individual is either dextral or sinistral for 
a given character? Is ambilaterality non-existent or even of very rare occurrence, 
or does it occupy its appropriate place between the extreme values of dextrality 
and sinistrality ? That is to say: Is laterality a continuous variate ? 

(6) To what extent is the degree of laterality, ie. the difference of the 
character on right and left sides, a function of the intensity of the character 
itself ? 

II. Source or Data AND METHODS oF TREATMENT. 


In order to throw light on the above questions, the data for six manual and 
five ocular characters which are measured on students tested in the Anthropometric 
Laboratory of the Department of Applied Statistics, University College, London for 
both right and left sides, have been placed at my disposal by the Director of 
the Laboratory. The numbers of individuals available, all males, vary round 400 
according to the character; a certain number of students not having presented 
themselves for all the tests. The ages run from about 18 to 24, but we have not 
thought it needful to make any correction for age for this range of years. It has 
been shown that the changes in ocular and manual characters for these ages are 
relatively small. The characters dealt with will now be specified: 


1. Ocular Characters. 


(1) Near Point. This test is measured in inches by the nearest distance at 
which a diamond type can be read clearly. The type is advanced towards the eye 


by steps of } inch and the last 4 inch interval from the eye at which the type can 
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be read is recorded. Clearly, the smaller the distance recorded, the better the 
subject’s eye can accommodate itself. If the right side record in Near Point is 
larger than the left, it means the left eye dominates the right one in this respect. 
(2) Visual Acuity. Instead of the Diamond Type used by Galton for Visual 
Acuity, the University College Anthropometric Laboratory uses Snellen’s test types 
with additional lines of type to make a finer gradation. The subject sits at the 
distance of six metres. The score is taken from the last line read completely plus 
number of letters read after it. The number of letters in each line being known 
the score is calculated to the nearest second place of decimals as follows : 
Scores go = 10, $,='17, $,=°25, § = 33, 6, =°50, § =°67 and so on 
Number of 


‘ : 2 3 4, 5 6 
letters in line . ‘ 


For instance, if line $; is read completely and plus five letters anywhere 
beyond it, the score is taken = ‘25 + 4 (33 —°25)4+1(50 —°33) = 36. Our measures 
are obtained from the differences of distant vision between the two eyes. 

(3) Spherical Refractions. (Refractions in the “horizontal”* meridian and 
in the “ vertical” meridian.) Refractions are tested by the ophthalmoscope at one 
metre distance from the eye. No cycloplegic is used, but the subject is asked to gaze 
at a dimly illuminated card at six metres distance. Lenses are introduced until 
there is no movement of the “shadow.” Two measures are taken for each eye 
for refractions in the “ horizontal” and in the “ vertical” meridian. To avoid the 
difficulty of scoring we have omitted those cases, very few in number, where right 
and left eye have refraction of opposite sign in the same meridian. The better 
eye is to be taken as the one having refraction least in amount in dioptres 
regardless of sign, the other eye is taken to be inferior by the difference in 
dioptres between the two amounts of refraction in a given meridian. 

(4) General Astigmatism. Amount of astigmatism is measured by the differ- 
ence in refraction to the nearest } dioptre between axes of maximum or minimum 
refraction, i.e. difference between refractions in the two meridians classed as 
horizontal and vertical. The better eye is to be taken as the one having astig- 
matism least in amount, regardless of sign, and the other eye is inferior by the 
difference between the two amounts of astigmatism. 

2. Manual Characters. 

(1) Grip. Strength of grip is measured in pounds for each hand by a grip 
dynamometer. Three trials with each hand are allowed, introducing a brief pause 
between each trial to avoid excessive fatigue. For our purpose only the highest 
record registered at each trial for each hand was used. 

(2) First Pull and Last Pull., Measured also in pounds by means of a 
dynamometer for the right and the left hands. The experimenter adjusts the 
instrument to height of subject’s hand and instructs him that he is, at the signal 


* Principal axis nearer the ‘ horizontal.” 
Biometrika xxA 6 
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“now,” to pull as forcibly as possible, and to maintain this pull with his utmost effort 
until told to stop at the end of 30 seconds. The experimenter, after saying “now” 
for subject to begin, immediately takes the first reading, and thereafter glances at 
the scale and records every 5 seconds; thus seven readings for each hand are taken 
altogether. For our present purposes only the first and last readings, described 
briefly as measures of first pull and last pull, are chosen. 


(3) Difference of Strength at First Pull and Last Pull, or Lack of Endurance in 
Pulling. It will be instructive to compare the initial strength with the final for 
each hand to obtain an index of persistency. For this purpose, we subtracted 
the last reading from the first and used the difference, which was taken in pounds, 
as a measure of the Lack of Persistency or Endurance. 


(4) Steadiness of Hand Test. The subject stands by a table on which are 
placed two parallel horizontal brass plates at a distance of about 4 inches one 
above the other. The upper plate is pierced with holes, twenty in number, and 
lettered A to JT. The subject is given a steel needle 1 mm. diameter wired to 
a battery, and holding it truly vertical, he has to pass it through the holes in the 
upper brass plate, touch the lower plate, when an electric bell rings, and extract 
it again without touching the wall of any hole; if he does touch, another bell of 
different tone is rung and the subject loses that hole. He starts with the hole A of 
‘2th inch diameter, and if he succeeds, he tries hole B of 39th inch diameter, and 
so on. The last hole the subject can pass the needle through ringing only the 
first and not the second bell is a measure of his steadiness of hand. Each subject 
is allowed two trials. The marking starts at one for A and increases by unity 
up to the hole M at 13 marks, and there for the very fine holes increasing by two 
marks from NV at 15 to 7’ at 27. The marking depends solely on the last hole 
reached without contact with upper plate. 


(5) Balancing Test. A flat piece of board is placed with one end slightly 
projecting over the edge of the table. Upon this board one or other of two 
vertical rods attached to discs of different small diameters can be placed. The 
subject has to take the board with right or left hand and raise it shoulder high 
with the arm outstretched and return it in the same way to the table without 
upsetting the rod. He makes three trials with each rod and with each hand, or 
12 tests in all. He gets three marks for each success with the rod on the larger 
base and five marks for each success with the rod on the smaller base. The 
subject can thus obtain 0 to 24 marks with each hand. To obtain a continuous 
distribution, subranges of three were taken. 


It should be noted that the Steadiness of Hand and Balancing Tests are not 
purely manual tests for they include some ocular control; they are really mixed 
tests, included under manual tests merely for convenience. 


For all the above characters the intensity of laterality was measured by the 
difference of the performance of the right side from the left (right minus left) ; 
if this be positive the subject is termed a dextralist for this character; if negative 
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a sinistralist. But in such cases as Near Point, the two Refractions and General 
Astigmatism, the signs of individual scores are reversed (or left minus right) 
because we consider that the better score should be less in amount. 

io . = m4 M , y . 5 

For simplicity we shall generally use the symbols, A,, A,,....A, to represent 
the differences of the six manual characters, B,, B., ... B; to represent those of the 
five ocular characters. 

Let us now turn to the data and see how far we are able to’some extent to 
answer the questions propounded above. 


III. PROBLEMS AND RESULTs. 


Dae . - . 
Problem 1. For the population as a whole, for each character of hand and eye, 
which side is dominant and by how much ? 


Table I gives the values of the means and standard deviations for each 
character of hand and eye based on the data (a) of the whole population, (b) of 
individuals showing dextrality, (c) of those showing sinistrality. From Table I 
it will be seen that, on the average, all ocular characters are a little superior 
in the right eye but that this superiority is insignificant except in the case of the 
two Refractions. For the manual characters the superiority of Grip is, in all cases, 
in favour of the right hand and is large, 4°53 lbs. This is approximately 1°3 lbs. 
higher than we obtained from Galton’s data, The higher value for this character, 
obtained in the present investigation, may be due to the fact that the data 
were taken from College Students so that no young children and old people are 
included. 


The steadiness and balancing tests are about 14 times better for the right 
hand than for the left. For the first pull and the difference between first and 
last pull the superiority of the right hand over the left is about $ lb. which is 
small if compared with the superiority of grip. In the case of the last pull, the 
left hand shows a slight superiority which, however, becomes insignificant when 
compared with its probable error. 

Let us now study the fourth and sixth columns of Table I and compare the 
constants of each side. It is obvious that for all ocular characters the difference 
of means between both sides is very small, occurring only in the decimal places of 
the given units, and that the difference of variation between each corresponding 
pair is also small, These results appear to agree with those we obtained from 
Galton’s data on Visual Acuity. The comparison of the right and left superiority 
(Columns 4 and 6) for three of the manual characters—namely the three pull 
tests—also shows that, although the values are not equal, there is only a slight 
difference between them. The right hand has, on the average, a superiority of 
2°6 lbs. for the grip test and 1 unit for the steadiness and balancing tests. Their 
variabilities are a little greater on the right than on the left. 

From all these results, it will be seen that, though most of the characters 
studied show a superiority of the right side the amount in more than half the 
cases is too small to be significant. 

6—2 
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: Therefore it is impossible to say that all the characters of hand and eye are 
always superior in one direction and that the amount of superiority, whatever its 
direction, is always the same proportion of the whole intensity (4) as van Biervliet * 


asserts. 


Problems 2 and 3. 


ocular character also dominant for the others ? 


ocular laterality absolute ? 


Is the hand or eye which is dominant for one manual or 
Ts manual laterality absolute? Is 


(a) On the relationships of percentage occurrences of the dominance of (1) Ocular, 


; (2) Manual characters. 


It has been suggested by those who hold the theory of absolute laterality 


both for hand and eye that these characters follow the simple Mendelian ratio 


with dextrality as the dominant and sinistrality as the recessive character. 


This 


(according to them) means that three quarters of the population show dextrality 
and a quarter sinistrality while there are no ambilaterists at all. We have found that 


Percentages of Ocular and 


a. Ocular L ater: ralitie 48 


Sinistralists 


Ocular characters 


| lst character 








| Near Point | 16-18-41-22 (67) 
Visual Acuity | 44-39 +1-64 (186) | 
tefraction in the | 23°86+1-45 (94) 


Horizontal Meridian 
Refraction in the 

| Vertical Meridian | 
General Astigmation 


| 21-7741-39 (86) | 1-35 
29-02 +1-50 (121) 


. Manual Lateralities 
Sinistralists 





| . 3 
| Manual characters ae 
Percentages a8 
| Bs 
| | m ; ; | 
| Grip | 15-69-+41:15 (72) | 1:00 
| | First Pull 37-94 +-1-53 (173) | 2-42 
Last Pull | 43-81 1-57 (198) | 2-79 
Lack of Endurance | 40-67 +1-56 (183) | 2-59 
| (First Pull —Last Pull)| 
|Steadiness of Hand | 21:13-1-29 (97) | 1:35 
| Balancing Test |? 29-98 +1-45 (137) | 1-91 





* «*T/asymétrie sensorielle,” 
pp. 326—366, Bruxelles, 1897. 











TABLE II. 


Ambilateralists 





i 1 538 
S23 2e 
a os | Percentages 238 
ass Ee 
et | on 
% | 
- $82 | 58-94+1-63 (244) | 1-00 
+ 19-39 | 16-47+1-22 (69) | 0-28 
1-14 | 29-44+41-55 (116) | 0-50 
| 
— 3-23 | 31-65+41-58 (125) | 0-54 
+ 4-02 | 46-28 0-79 


£ 1-65 (193) 


Ambilateralists 








Manual Lateralities for all Observations. 


| 
| Percentages 
a = 
24-88 +1-43 (103) | 1-00 
39-14+1-61 (164) | 1-57 
| 46- 7041: 70 (184) | 1-88 
1 | 
| 46-3 58 +1-68 (184) | 1-87 
| | 
24-70 +1-43 (103) | 0-99 


” De vxtralints 


| 
| 
| > ] be "i 
| .~o | 
= Se | | 
& | 2é | 
oe Percentage ja 3 | Percentages 3 
ee = | | 23 
=) > 2 
ow a e = = 
% | | 
— 931 5-66 -40-73 (26) | 1-00 | 78-65-+1-29 (361) | 1-69 
+ 12-94 | 9-65 +0-93 (44) | 1-70 | 52-41-+41-58 (239) | 0-67 
18-81 | 11-28+1-00 (51) | 1-99 | 44-91 -+.1-58 (203) | 0-57 
+1567 | 8-22+40-87 (37) | 1-45 | 51-11 £1-59 (230) | | 0-65 
» | | 
—~ 3-87 | 13-73-41-08 (63) | 2-43 | 65-14-+.1-50 (299) | 0-83 
+ 4:98 | 18-16+1-22 (83) | 3-21 | 51-86+1- ‘SS (287) | | 0-66 
Bulletins de VAcadémie royale de Belgique, 3™° série, Tom. xxxtv. 
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Galton’s data provided no confirmation of this hypothesis. Let us now see whether 
our present results support the theory of absolute laterality. Table II gives the 
percentage occurrence of each of the three main types of laterality for the 
characters studied in the total observations. It will be seen that the percentages 
of each type of laterality vary very considerably for different characters. 

If each column be examined separately it will be found that the percentages 
do not correspond to those required by a supposed theory of Mendelian occurrence*. 
Only one of the sinistral characters nearly approaches a 25 per cent. occurrence, 
having regard to its probable error—this is the refraction taken from the horizontal 
meridian which gives a value of 23°86 + 1°45 per cent. However, the dextralists 
for this character are only 46°70 + 1°70 per cent. and the value for ambilateralists 
being 29°44 + 1°55 per cent. shows them to be far from non-existent. None of 
the dextral percentages gives a result of 75 per cent. and all the figures for 
ambilateralists are significant. The smallest percentage of ambilateralists is for 
Grip, but although this is only 566 +0°73 per cent. it is definitely significant 
considering the value of the probable error. In order to compare all the characters 
used more clearly we have found the value of the deviation of the observed 
percentages from those required by the Mendelian ratio for both dextrality and 
sinistrality. There is a significant difference between all of them except for 
sinistrality in the case of horizontal refraction. 

The divergence of the percentages for the various characters of each group 
can be tested in another way. The percentages of Near Point and of Grip can be 
supposed to be units for ocular and manual characters respectively and ratios of 
the percentages of other characters to that of the corresponding unit are calculated 
and set forth in the above table. It will be obvious how largely they differ from 
one another in amount. For instance, in the case of Near Point and Visual Acuity 
there are 2°74 times more sinistralists and 1°57 times more dextralists for the 
latter character but there are only 0°28 times more ambilateralists. 

From the above results we may assert that neither manual nor ocular later- 
ality is absolute. In other words, it does not follow that because the right hand 
or eye is dominant for one character it is necessarily dominant for another even 
though the two characters may be closely related. 

We will now proceed to analyse our data more closely. So far we have only 
considered the laterality of each character, clubbing together all observations, and 
we may have confused the issues by only considering the total frequency of the 
laterality of one variate without studying the associate . distribution of laterality 
of the two variates combined. For this reason, the percentage occurrences in 
nine categories of lateralism for any two characters of the two groups are set 
forth in Table IIT a and 6. 


ocular group and fifteen for the manual one. 


It involves twenty-five pairs of characters, ten for the 
Each presents three kinds of 
percentages, namely, (1) those in round brackets are the actual percentages of 
the total population, (2) those in square brackets are percentages in the cells of 
* See American Breeders’ Magazine, Vol. 11, pp. 19 
imerican Naturalist, Vol. xuvi1, pp. 734—738. 


113; Journal of Genetics, Vol. 1v, pp. 67—81; 
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' TABLE ITT. 
Percentages Observed and Expected for each Category of (a) Ocular Lateralities, 
and (b) Manual Lateralities. 
(a) Ocular Lateralities. 
(1) Near Point and Visual Acuity. 
| Visual Acuity 
| Sinistrality | Ambilaterality Dextrality Total 
Near Point |— es aa an ee a pe 8 
| % of Totals | % of % of Totals | o% of |__ bedecaleseanl | % of | . 
? | Obs. | Exp. Column | Obs. Exp. | Column | Obs. | Exp. meet 
Sinistrality | 24 (5-80)| (6-88) | [13-64]| 9 (217)| (2-77) | [12-68]| 34 (8-21)| (6-53) | [20-36] | 67 (16-18) | 
Ambilateraiity | 103 (24-88) | (25-05) | [58-52] | 45 (10-87) | (10-12) | [63-38] | 96 (23-19) | (23-77) | [57-49] | 244 (58-94) 
i |Dextrality ~ | 49 (11-83) | (10-58) | [27-84] | 17 (4:11)| (4-26) | [23-94] | 37 (8-94) | (10-04) | [22-15] | 103 (24-88) 
1 | 
: = 2 
Totals 176 (42-51) 71 (17-15) | 167 (40-34) | 414 (100) | 
(2) Near Point and Refraction in “ Horizontal”? Meridian. 
Refraction in “* Horizontal’’ Meridian | 
| | 
: —_ PIES EEE ARE RES pe eer ee a ae 
Sinistrality | Ambilaterality Dextrality 
y Near Point | = : — = A sere eee Totals 
s % of Totals % of | % of Totals % of | __ % waltamenl = % of 
Obs. | Exp. Column Obs. Exp. Column Obs. Exp. Column | 
Sinistrality | 15 (387) | (3-43) | [15-96]| 10 (2-58)| (4-25) | [8-62]| 30 (7-73) | (6-50) | [16-85] | 55 (14-18) | 
Ambilaterality | 56 (14-42) | (14-73) | [59-57] | 76 (19-59) | (18-18) | [65-52] | 104 (26-81) | (27-91) | [58-43] | 236 (60-82) | 
| Dextrality 23 (5-93) | (6-06) | [24-47] | 30 (7-73) | (7-47) | [25-86] | 44 (11-34) | (11-47) | [24-72] | 97 (25-00) | 
| sina 
‘ | Totals | 94 (24-22) 116 (29-90) 178 (45-88) | 388.(100) | 





(3) Near Point and Refraction in “ Vertical” Meridian. 


Refraction in “ Vertical” Meridian 


Sinistrality 








| 
bilaterality Dextrality | | 
be Ambilaterality extrality | Totals 
| Near Point | === _ a = |- —— wee a 
| | % of — o% of % of Totals | o of | % of — o% of | ” 
| Obs. | Exp. Column Obs Exp Column | Obs Exp Column | | 
| Sinistrality 17 (4:37) | (3-33) | [21-25] | 13 (3-34) | (5:37) | [10-08] | 33 (8-48) | (7-49) | [18-33] | 63 (16-19) 
| Ambilaterality | 42 (10-79) | (12-32) | [52-50] | 87 (22-37) | (19-86) | [67-44] | 104 (26-74) | (27-72) | [57-78] | 233 (59-90) | 
|Dextrality ~ | 21 (5-41) | (4-92) | [26-25] | 29 (7-45) | (7-93) | [22-48] | 43 (11-05) | (11-06) | [23-89] | 93 (23-91) | 
| Totals | 80 (20:57) 129 (33-16) 180 (46-27) | 389 (100) | 
(4) Near Point and General Astigmatism. 
: General Astigmatism | | 
Sinistrality | Ambilaterality | Dextrality | 
, oe : : : Totals | 
Near Point | | ‘ Bee iS ee, ——| _ | 
| -- % of Totals _| %of | % - Totals o of | Lacabadanonci _| % of | | 
| Obs. Exp. |Column | Obs’ | Exp. Column Obs. | Exp. {Column | 
|Sinistrality | 26 (6-33) | (4-40) | [22-03] | 19 (462) | (7-20)| [9-84]| 18 (4-38) | (3-73) | [18-00] | 63 (15-33) | 
| Ambilaterality | 68 (16-54) | (17-12) | [57-63] | 117 (28-47) | (27-99) | [60-62] | 60 (14-60) | (14-50) | [60-00] | 245 (59-61) 
Dextrality | 24 (5:84)| (7-19) | [20-34] | 57 (13-87) | (11-7) | [29-54] | 22 (5-35) | (6-10) | [22-00] | 103 (25-06) | 
: | ‘ ae ee : . ce Ss SSS PS es eee 
Totals 118 (28-71) 193 (46-96). 100 (24-33) | 411 (100) | 
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| 

Visual 
Acuity 
| 


Sinistrality 
| Ambilaterality 
Dextrality 


| Totals 

— 

| Visual 
Acuity 


| Sinistrality 
Ambilaterality 
Dextrality 


| Totals 


Visual 
Acuity 


Sinistrality 
Ambilaterality 
Dextrality 


| Totals 


Refraction in 
“Horizontal” 
Meridian 


| Sinistrality 
| Ambilaterality | 
| De xtrality 


| Totals 


Dextrality and Sinistrality of Hand and Eye 


(5) Visual Acuity and Refraction in “ Horizontal” Meridian. 















































Refr action in “ Horizontal” Meridian 
Sinistrality | ~ Ambik ate ré ality ag Dextrality Totals 
rs a . (i ap 
% of Totals | o of | - % of Tot: il | o %, of | % of ' Totals of of /o 
Obs. | Exp. \Column| Ops. | Exp. \¢ a ie | Exp. Column 
aS Sek = SSE 
46 (11-71) | (9-79) | [50-55] | 53 (13-48) | (12-79) | [44:54] | 67 (17-05) | (19-66) | [36-61] | 166 (42-24) 
7 (1-78) | (4:00) |} [7-69] 33 (8-40) (5-24) | [27-73] | 28 (7-12) | (8-06) | [15-30] | 68 (17-30) 
38 (9-67) | (9-37) | [41-76] | 33 (8-40) | (12-25) | [27-73] (18-84) | [48-09] | 159 (40-46) 
91 (23-16) 119 (30-28) | 183 (46-56) 393 (100) ' 
(6) Visual Acuity and Refraction in “ Vertical’? Meridian. 
tefraction in “ Vertical” Meridian | 
Sinistrality Ambilaterality | Dextre ality Totals 
a ve = ay ‘ 
9 of Tots 8s | 9 ) Tote 8 | 9 rt ° 
b Cotal =e % of Total | 0 of | o of Totals _| 9% of 
Obs Exp Column Obs. Exp. ( Jolumn | Glia. Exp. Column | 
37 (9°37) | (9-21) | [44-05] | 66 (16-70) | (14-02) | [51-56] | 68 (17-22) | (20-06) | [37-16] | 171 (43-29) 
17 (4:30) | (3-71) | [20-24] | 29 (7-35)| (5-67) | [22-66] | 23 (5-82)| (8-09) | [12-57]| 69 (17-47) 
30 (7-59) | (8-34) | [35-71] | 33 (8-36) | (12-72) | [25-78] | 92 (23-29) (18-18) | [50-27] 155 (39-24) | 
84 (21-26) | 128 (32-41) | 183 (46-33) 395 (100) | 
(7) Visual Acuity and General Astigmatism. 
- _ i — —— - — ———— ’ 
Gener. ral Asti; gmatism 
Sinistrality | Ambik: ater ré ality, | Dextr: ality Totals 
— —| - seed am 0/ 
% of Totals | | Q s Y of Tote / 
% of Total: err L of Tot als i % of | % of Totals | % of 0 
Obs. Exp. Column Oke. | Exp. \¢ Jolumn | Obs. Exp. Column 
57 (13-68) | (12-53) | [47-11] 76 (18-22) | (19+ 98) | [39- 38] 47 (11-27 7) | (10-66) | (45-63) | 180 30 (43-17) 
17 (4:07) | (4:94) | [14:05] | 39 (9-35) | (7-87) | [20-21] 15 (3-60) | (4: 21) | [14 a3} | 1 ha (17-02) 
47 (11-27) | (11-55) | [38-84] | 78 (18-71) | (18-43) | [40-41] | 41 (9-83 (9-83) | [39-81] | 16 6 (39:81) 
121 (29-02) | 193 (46-28) 103 (24-70) | 417 (100) 





(8) Refractions in “ Horizontal” and “ Vertical’ Meridians. 


Refrac tion j in ‘ “Vertical” Meridian 














Sinistrality Ambilater: ality Dextrality Totals 
a | a 
o& of Totals | oY o ‘0 s wy) /0 
a % of Total 9% of |. 9 of T tals _| 9% of % ot Totals | 0% of 
Obs Exp Column Obs Exp Column | Obs | Ex Column 
Se | 4 . | Se 42 ° | | Se 2 . 
47 (12-27) | (4-74) | [59-49] | 18 (4-70) | (7-62) | [14-18]| 23 (6-01) | (10-62) [12-99] 88 (22-08) 
14 (3-66) | (5-93) | [17-72] | 76 (19-84) | (9-52) | [59-84] | 20 (5-22) | (13-27) | [11-30] | 110 (28-72) | 
18 (470) | (9-96) | [22-79] | 33 (8-62) | (16-02) | [25-98] | 134 (34-98) | (22-32) | [75-71] 185 (48-30) | 





79 (20-63) | 127 (33-16) | 177 (46-21) ~ | 383 (100) 














7°30) 
0-46) 


00) 











2-98) 
8-72) | 
8-30) | 


00) 
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(9) Refraction in “Horizontal” Meridian and General Astigmatism. 





| General Astigmatism 











Refraction in " Sinistr alit y | 





























































| 
| saints 
| Ambilate srality | Dextrality Totals 
| “Horizontal” = 6 : ———— (ee a . 
| Meridia % of Totais = of ‘ ) ‘otals % 
| eridian |__%e 7 ot als | % of z , 0 Tot ide a % of \ = asic o% of 
| Obs. | Exp. \¢ eens “Obs. | :E xp. ( tae, Obs. | Exp. g = h 
| -_ a | 
Sinistrality | 46 (11-68) | (6-73) | [41-44] | 27 (6-8: 5) | (11-26) | [14-51] | 21 (5-33) | (5-87) | [21-65] | 94 (23-86) | 
| Ambilaterality | 28 (7-10) | (8:29) | [25-23] | 7 (19-55) | (13-90) | [41-40] | 11 (2 79) | (7-25) | [11-34] | 116 (29-44) 
Dextrality 37 (9.39) (13-15) | [33-33] | 82 (20-81) | (22 05) | Lt 09] | 65 (16-50) | (11-50) | [67-01] | 184 rr 70) 
| Totals 111 (28:17) | 186 (47-2 | 97 (24 62) 394 (100) 
(10) Refraction in “Vertical” Meridian and General Astigmatism. 
| General Astigmatism | 
| | = “ es eee a ania a 
Re fraction in | Sinistrality | Ambiintecality | De »xtrality | — 
: ’ | Totals 
| * ‘Vertical” . : = = —|- — ~ 
Meridian | oe % of ses | 0% of % of Totals 0% of | % of Totals _| % of 0 
| Obs. | Exp. i‘ Jolumn | Gin. | Exp. i Jolumn | Obs. | Exp. |‘ tolumn | 
Sinistrality | 31 (7-86)| (6-07) | [28-18]| 28 (7-08) | | (10-47) rev] | | 27 (683) | (5-2 3) | [28-42] | 86 (21-77) 
Ambilaterality | 19 (4-81) | (8-81) | [17-27] | 81 (20-51) | (15-22) | [42-63] | $5 (6-33) | (7-62) | [26-32] | 125 (31-65 
Dextrality 60 (15-18) | (12-97) | [54-55] | 81 (20-51) | (22-41) | [42-63] 3 (10-89) | (11-20) | [45-26] | 184 (46-58) 
2 | | = — 
Totals | 110 (27-85) | 190 (48-10) i 95 (24-05) 395 (100) | 
(b) Manual Lateralities. 
(1) Grip and First Pull. 
] | First Pull is 
is < 7‘ 5 - | 
| Sinistrality Ambilaterality | Des extrality Totals 
Grip | : = ota s 
9 ‘otals ol % of Totals % of Totals > 
| ; 4 of Total | % of | , of Total ie ” of | , 0 Tota o% of | C 
| Obs | Exp Column Obs Exp | Column |} Obs | Exp. \¢ Yolumn | 
Ss. Xp. Ss. Xp. 3. 
Sinistrality | 36 (7-93) | (5-93) | [20-93] 7 (1-54) | (1:51) | [15-91] | 28 (6-17) | (8-20) ne 77) | : oe (1: 5-64) 
| Ambilaterality | 10 (2-21)| (2-17)} [5-81] 1 (0-22) | (0-56) | [02-27]| 15 (3-30) | (3-00) | [6-30]| 26 (5-73) 
| Dextrality 126 (27-75) | (29-79) | [73-26] | 36 (7-93) | (7-62) | [81-82] | 195 (42-95) | (41-22) | [81-93] | 357 (78-63) | 
| - | a | — - | ——— 
| Totals | 172 (37-89) 44 (9-69) 238 (52-42 | 454 (100) | 
(2) Grip and Last Pull. 
; Last Pull” 
| | Sinistrality. | Ambilaterality | Dextrality " 
= a e : __ j = Totals 
| Grip a vr 
y= | ee 0/ of Tatale l 
- % of Totals | o% of | % of Totals | 9% of | % of Totals | © of | o 
| Obs. | Exp. {Column | Obs. | Exp. perl Obs. | Exp. |‘ Jolumn | 
z ; =e = ae 
| | 
Sinistrality | (6-91) | [18-78] | 5 (1-11) | (1-75) | [10-00]| 29 (6-44) )| (7-12) | [14-29] | 71 (15-78) 
| Ambilaterality | (2-43) | [8-12] | b. (0-00) | (0-62) | [0-00]| 9 (2-00)| (2-50)| [4:43]| 25 (5-55) 
|Dextrality | B44) | [73-10] | 45 (10-00) | (8-74) | [90-00] | 165 ( ‘6. 67) | (35-49) | [81-28] | 354 (78-67) 
Totals 197 (43-78) 50 (11-11) 203 (45-11) 450 (100) 
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(3) ¢ 


Sinistrality 





Grip 
% of Totals 
| | Obs Exp. 
Sinistralits | 32 (7-16) (6-35) 
Ambilaterality | 8 (1-79) | (2-30) 
Dextrality 144 (32-21) | (32-51) 
| Totals 184 (41-16) 
| | 
| 
| Sinistrality 
Grip 
| % of Totals 
| | Obs. Exp. 
| Sinistrality 19 (4:14) | (3-31) 
Ambilaterality 5 (1-09) (1-20) 
Dextrality 73 (15-90) | (16-62) 
| Totals | 97 (21-13) 
! - 


Sinistrality 


Grip 
% of Totals 


Obs Exp 
| Sinistrality 29 (6-39) | (4-68) 
Ambilaterality 6 (1:33) | (1-60) 
Dextrality | 97 (21-36) | (22-80) 


| Te tals 132 (29-08) 


Sinistrality 
| First Pull 
| % of Totals 


Obs. Exp. 
Sinistrality 94 (20-80) | (16-57) 
Ambilaterality 23 (5:09 (4:37) 
Dextrality 81 (17-92) | (22-87) 
| Totals 198 (43-81) 


trip and Lack of Endurance in Pulling (First Pull 


zack of Endurance in Pulling 
Lack of E 


Ambilaterality 


o% of | % of Totals % of 

( weet Cin: Exp. |Column 
| ™ x 

| 117-39) | 6 (1-34) | (1-28) | [16-22 

[4°35] 0 (0-00) | (0-46) | [00-00] 

[78-26] | 31 (6-94)| (6-54) | [83-78] 


37 (8-28) 





Dextrality and Sinistrality of Hand and Eye 


Last Pull). 








| 
| | 
Dextrality . | Totals | 

| | a} 

0 FT otale | /O 

6 of Totals o% of | 
Obs Exp i 
31 (6-94) | (7-81) | [13-72] | 69 (15-44) 
17 (3-80) | (2-83) | [7-52]| 25 (5-59) 
178 (39-82) | (39-92) | | (78-76) | 


353 (78-97) | 
| | 


226 (50-56) | 447 (100) 


(4) Grip and Steadiness of Hand Test. 


Steadiness of Hand Test 


Ambilaterality 


Y, of % of Totals ”, of 
( A aol Obs | Exp. ( R: shell 
[19-59] 7 (1-53) | (2-15) | [11-11] 
[5-15] 3 (0-65) | (0-78) | [4-76] 
[75-26] | 53 (11-55) | (10-80) | [84-13] 
63 (13-73) 
(5) Grip and Balancing Test. 
salancing Test 
Ambilaterality 
o% of 6 of Totals o% of 
Column Obs. Exp. Column 
[21-97] 13 (2-86) | (2-94) | [15-66] 
[4-55] 5 (1-10)} (L1-01)| [6-02] 
(73-48) | 65 (14-32) | (14-33) | [78-32] 
83 (18-28) 
(6) First Pull and Last Pull. 
Last Pull 
Ambilaterality 
o of fy of Totals o% of 
Column Obs. Exp. Column 
[47-47] | 18 (3-98)] (4-27) | [35-29] 
[11-62] 5 (1-11) (1-12) | [9-81] 
[40-91] | 28 (6-19) | (5-89) | [54-90] 


51 (11-28) 


— | 
Dextrality Totals | 
7- l | 0 
% of Totals o% of | ) 
Oba, Exp. Column | 
46 (10-02) | (10-23) | [15°38] | 72 (15-69) 
18 (3-92) | (3-68) | [6-02] | 26 (5-66) 
| 235 (51-20) | (51-23) | [78-60] | 361 (78-65) 
299 (65-14) | 459 (100) | 
Je > ; ry 
Dextrality | Totals 
0 


| % 


q a) of Totals | ° , of 

Oba. Exp. | ( olumn | 
31 (6-83) | (8-46) | [12-97] | 73 (16-08) 
14 (3-08 (2-90) | [5-86] |} 25 (5-51) 

194 (42-73) | (41-28) | [81-17] | 356 (78-41) 
239 (52-64) | 454 (100) 
Dextrality 

aaaeaciy Totals 
% of Totals % | 
6 0 otals | L of / 

Obs. Exp. pee | 
59 (13-05) | (16-99) | [29-07] | 171 (37-83) | 
17 (3-76)| (4:47)| [837]] 45 (9-96) 

127 (28-10) | (23-45) (62-56) | 236 


(52- 21) 


203 (44-91) 


| 452 (100) 








als =| 


6-08) 
5-51) 
8-41) 


00) 





First Pull 


| Sinistrality 
Ambilaterality 
| Dextrality 


Totals 


| 
| First Pull 


Sinistrality 
Ambilaterality 
Dextrality 


Totals 


First Pull 


Sinistrality 
Ambilaterality 
Dextrality 


Totals 


Last Pull 


Sinistrality 


Ambilaterality 


| Dextrality 


| Totals 
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~~ 
~I 


) First Pull and Lack of Endurance in Pulling. 





bed k of: Endurano e in Pulling 
Sinistrality | Dextrality 
| 
| | =e ‘ 
% of Totals | % of Totals | % of Totals 
2 % of | 3 i % of | 2 
| Column | 


Ambilaterality | 


Obs. | Exp. Obs. Exp. \Column) Obs. Exp. 
113 (25-11) | (15-36) a 5]| 14 (3-11) | (3-11) | [37-84] | 43 (9-56) | (19-31) | [ 
18 (4-01) | (3-98)| [9-84]| 5 (1-11)| (0-80) | [13-51]| 21 (4-66) | (5-00) 
52 (11-55) | (21-38) | [28-41] | 18 (4-00) | (4-31) | [48-65] | 166 (36-89) | (26-80) || 


183 (40-67) 230 (51-11) 


(8) First Pull and Steadiness of Hand Test. 
Steadiness of Hand Test 


Sinistrality Ambilaterality Dextrality 


% of Totals % of Totals % of Totals 


o/ 
tof 


Column 


% of 


Column 


Obs. Exp. Obs. Exp. Obs Exp 
41 (8-99) | (8-07) | [42-27] | 26 (5-70) | (5-16) | [41-94] | 106 (23-25) | (24-71) 
10 (2-19) | (2-06) | [10-31] 2 (0-44)] (1-31)| [3-22]} 32 (7-02)] (6-2 
46 (10-09) | (11-14) | [47-42] | 34 (7-46) | (7-13) | [54-84] | 159 (34-86) | (34-14) 


97 (21-27) 62 (13-60) 


(9) First Pull and Balancing Test. 


Balancing Test 


Sinistrality Ambilaterality Dextrality 


% of Totals | % of Totals | oy 


o a 0 
6 of é, of 


Column 


of Totals 


}Column 


o 


Oo; 
o 


6 of 


of 


Column 


Oo 
Column 


[35-69] 
[10-77] 
[53-54] 


Column 


| 
| 


| 
| 


Totals 
% 


== | 


| 170 (37-78) | 


| 450 (100) | 


44 (9-78) | 
236 (52-44) | 


173 (37-94) | 
44 (9-65) 
239 (52-41) 


456 (100) 


Obs. Exp. Obs. Exp. Obs. | Exp. 
58 (12-86) | (11-35) | [42-96] | 28 (6-21)| (6-89) | [34-15] | 85 (18-85) | (19-68) | [36-33] 
13 (2-88) | (2-92)| [9-63] 9 (2-00) | (1:77) | [10-98] | 22 (4-87)| (5-06) | [9-40] 
64 (14-19) | (15-66) | [47-41] | 45 (9-97) | (9-52) | [54-87] | 127 (28-17) | (27-15) | [54-27] 
135 (29-93) | 82 (18-18) 234 (51-89) 
(10) Last Pull and Lack of Endurance in Pulling. 
Lack of Endurance in Pulling 
Sinistrality Ambilaterality cesses 
| % of Totals | 0% of % of Totals of of % of —— of of 
Obs | Exp Column Obs. Exp. Column | Ola. Exp. ~|Column | 
| } 4 ——— 
37 (8-23) | (17-82) | [20-11] | 14 (3-11) | (3-58) | [37-84] | 145 (32-22) | (22-16) | [63-32] | 
18 (4:00)| (4:54)| [9-78]| 5 (1-11)}| (O91) | [13-51] |} 27 (6-00) | (5-66) | [11-79] 
| 129 (28-66) | (18-53) | [70-11] | 18 (4-00) | (3-73) | [48-65] | 57 (12-67) (23-07) | [24-89] | 
| | 
184 (40-89) 37 (8-22) 229 (50: 89) 


Totals 
o 
0 





| 
° 
| o 
| 
| 


196 (43-56) 
50 (11-11) 
204 (45-33) 


= | 450 (100) 














92 Dextrality and Sinistrality of Hand and Eye 


(11) Last Pull and Steadiness of Hand Test. 


| Sinistrality 
Last Pull - 
| %, of Totals 





| % of | % of 

Obs. | Exp. Column | Obs. | Exp. {Column 
| Sinistrality 48 (10-64) | (9-25) | [50-53] | 29 (6-43) | (6-04) | [46-77] | 
| Ambilaterality 9 (2-00) | (2-33)| [9-47] $ (0-89) | (1-52)| [6-46] | 
Dextrality 38 (8-42) | (9-48) | [40-00] | 29 (6-43)! (6-19) | [46-77] 


Totals 95 (21-06) 
on = = 
| Sinistrality 
| Last Pull 
| % of Totals 
Obs Exp ( 

Sinistrality GL (13-74) | (13-19) | [ 
Ambilaterality 10 (2-25) | (3-40) 

| Dextrality 63 (14-19) | (13-59) | | 


| Totals 134 (30-18) 


Steadiness of Hand Test 


Ambilaterality 


% of Totals 


| 
| 
| 
| 
| 





62 (13-75) 


(12) Last Pull and Balancing Test. 


Balancing Test 
Ambilaterality 


% of Totals 


% of } % « of 
‘olumn tn. Exp. Column 
45-52] | 39 (8°78) | (7-87) | [48-75] 
[7-46] 12 (2-70) | (2-03) | [15-00] 
47-02] | 29 (6-54) | (8-12) | [36-25] 


80 (18-02) 





Dextrality 
7 ey _ Totals 
% of Totals | % | 
= % of | 
Obs. | Exp. eet | 





121 (26-83) | (: rs $0] [ 
37 (8-20) | [12 
136 (30-16) | (: $9. 34) | [ 


| 198 (4 3-90) | 
50 (11-09) 
203 (45-01) | 


294 (65-19) | 451 (100) 


| 

| 

| 

Jextrality 

Dextrality | Totals | 
an | 9 | 
% of Totals o% of | o | 
Obs. | Exp. Column | | 
94 (21-17) | (22-63) | [40-87] | 194 (43-69) 
28 (6-31) | (5-83) | [12-17] | 50 (11-26) | 
LO8 (24°32) | (23-34) | [46-96] | 200 (45-05) | 


444 (100) | 


(13) Steadiness of Hand and Lack of Endurance in Pulling. 


Sinistrality 


Lack of Endurance in Pulling 


Ambilaterality 


Dextrality | 


% of Totals 7 o | 
% of | 
Obs. Exp. ( olumn | | 

| 
£1 (9-11) | (10-49) | [18-14] | 94 (20-89) | 
30 (6-67) | (6°70) | [13-27] | 60 (13-33) 
155 (34-44) | (33-03) | [68-59] | 296 (65-78) 


226 (50-22) 450 (100) 
= ee I 


Steadiness 
of Hand % of Totals % of Totals : 
%, of %, of 
, Obs. Exp. Column Obs. Exp. Column 
Sinistrality 43 (9-56) | (8-68) | [22-99] 10 (2-22) | (1-72) | [27-03] 
Ambilaterality | 26 (5-77) | (5-54) | [13-91] £ (0-89) |} (1-09) | [10-81] 
Dextrality 118 (26-23) | (27-34) | [63-10] | 23 (5-11) | (5-41) | [62-16] 
| Tot ils 187 (41-56) 37 (8-22) 
(14) Balancing Test and Lack of Endurance in Pulling. 
“Tac k of E ndurt ance in Pulling 
Sinistrality | Ambilaterality 
vlancing 
| Test % of Totals Se % of Totals 
| % of %, of 
Obs. Exp. Column Obs. Exp. Column 
Sinistrality 60 (13-51) | (12-09) | [3297]| 9 (2-04) | (2-46) | [24-33] 
Ambilaterality | 31 (6-98) | (7-66) | [17-03] 5 (1-12)| (1-56) | [13-51] 
| Dextrality 91 (20-50) | (21-24) | [50-00] | 23 (5-17) | (4-31) | [62-16] 


Totals | 


182 (40-99) 





Dextrality } 

extrality | Tot ls | 

% of Totals o% of Yo 

Obs. Exp. Column | 

62 (13-96) | (14-96) | [27-55] | 131 ( 29-1 51) | 

47 (10-59) | (9-47) | [20-89] | $3 {18-69} | 
116 (26-13) | (26-25) | [51-56] | 230 (51-80) 


225 (50- ) 


| 444 (100) | 
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(15) Steadiness of Hand and Balancing Test. 























| Balancing Test 
tals } Sinistrality ~ Ambilater ality Dextrality 
%, Steadiness |—— ; 7 i Se alae : —— — i Totals 
| of Hand % of Totals o of % of Totals D2 ) of % of Totals of /o 
—_— —" = /0 —_ —— - Yo at 
| Obs. Exp. Column Obs | Exp. A cone mn ——. _-Exp. ote, 
| == —— = : |—_——_———_} —|—— ———— 
43-90) |Sinistrality | 21 (4-60)| (6- 30) | [15- 33] | 19 (4-16) | (3-82) [22-89] | 56 (12-25) | (10-89) | [23-63] | 96 (21-01) 
45-01) | | Ambilaterality | 4 (5-25 (4-07) | (17-52] | 12 (2-62) | (2-46) | [14-46] | 26 (5-70) | (7-04) | [10-97] | 62 (13-57) 
|Dextrality | 92 (20-13) | (19: 61) | [67-15] | 52 (11-38) | (11-88) | (62: 63) 155 (33-91) | (33-93) | [65-40] | 299 (65-42) 
5 se j= — —— _ jas ee 
100) | . eo 
100) | ‘Tot als | 137 (29 98) | 83 (18°16) | 237 (51-86) 457 (100) | 


the numbers at the foot of the column, and (3) those under the columns headed 
—— “expected” give the percentages which might be anticipated if there were no 
relation whatever between any two characters. These pairs of values may be tested 
by such a criterion as this: If little relation exists for the given traits, the per- 
centages of laterality of one character for each type of laterality of another should 
be sensibly the same as in the corresponding totals column, and also the differ- 
; ence in each category between the observed percentage and the expected should 
1128. be very slight. On the contrary in case a high positive relation exists between 
the two given traits, the observed value in the categories of the same laterality (or 
100) | 3 column percentage in square brackets) will be larger than the expected one (or 
= marginal percentage). Those in the categories of opposite laterality will be much 
less than the corresponding independent values, and the higher the correlation, 
— the larger the difference. If the correlation be negative and high, the difference 
between the percentages should be in the reverse order. By the criterion just men- 
tioned we may test each pair of the percentages in the above table. A quantitative 
6 relation between any two characters may be roughly predicted from the knowledge 
4 of the nature and the degree of those differences. From our material it will be 
_ found that in the Manual Group there are two cases, the dominance of First 
3.33) Pull (7) and Last Pull (10) with that of Lack of Endurance, where there are high 
35-78) | correlations but with opposite signs; and in another two, the dominance of First 
00) | Pull with dominance of Grip (1) and of First Pull with Last Pull (6), with moderate 
on but significant correlations. Similarly in the Visual Group we have one case, 
the dominance of the two Refractions, with high positive correlation, and two 
cases, the dominance of Visual Acuity with that of the two Refractions, with 
| moderate correlation. Besides these, the differences between the percentages for 
any other pair of characters are either not so appreciable or are definitely in- 
significant. These results show a good degree of agreement with the quanti- 
tative values of association which will be discussed later. 
: For the purpose of throwing further light on this material we have classified 
951) the actual observed percentages of all characters of the two groups into three 
1-80) | main classes which will be described as foliows: 
00) | (1) Same laterality involving individuals of same dextrality, same sinistrality 


and same ambilaterality. 
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(2) Opposite laterality involving those who are dextral in one variate but 
sinistral in the other of the same group. 

(3) Mixed laterality involving those who are dextral or sinistral in one variate 
but ambilateral in the other. 


Table IV provides the classified percentages in each type of laterality for all 
the characters of both groups. The mean percentages and standard deviations 
are also set down at the foot of each column respectively. When this table has 
been carefully examined the reader will conclude that the mean percentage of the 
same laterality in the case of both hand and eye is always the largest but 
that its value, which is less than 50 per cent. of the whole population, cannot be 
said to form a law of that population. Even in such a highly related pair of two 
traits as the two Refractions, the value of the same laterality still does not exceed 
70 per cent. of the whole. Another high value of the same laterality is found 
in the manual group in the case of First Pull and Lack of Endurance in pulling. 
But still there are 38 per cent. of individuals who do not follow the supposed 
rule. The mean percentage of opposite laterality for each group is comparatively 
low in its amount but the value for each group (approximately one-third in 
the manual group and one-sixth in the visual group) is amply sufficient to prove 
the fallibility of the proposed rule. Also if the numbers for mixed laterality 
be added we find that the individuals with the same laterality are actually in a 
minority. 


Further, if we consider the percentages of like laterality in any two characters 
in each column, it will still be found that none of the individual values for the 
common dextrality or for the common sinistrality is identical with the Mendelian 
ratio with the single exception of the common sinistrality for First Pull and Lack 
of Endurance (25°11 per cent.); and this is of no significance because Mendelian 
theory would demand 75 per cent. of the corresponding dextrality. 

The numbers of common ambilateralists for two characters are really very few, 
especially in the manual group, but the reader must remember that this does not 
mean that all the ambilateralists for any one character are non-existent. Take an 
extreme case, that of Grip and Last Pull for illustration. The number of common 
ambilateralists is actually zero but if the value of laterality of each character be 
studied separately we should have 11°11 per cent. of ambilateralists for Last Pull 
and 5°55 per cent. for Grip (see Table IIIb (2)). The same is true in the case 
of Grip and Lack of Endurance. Among the mixed lateralities we find that 
the cases of ambilaterality combined with dextrality are significantly higher than 
those combined with sinistrality particularly in the visual group. 

The standard deviations for each type of laterality in both groups are all 
significantly large, ie. there is a great variation of percentages from sample to 
sample against the rule of absolute superiority. 


From the consideration of chese percentage occurrences we realise thoroughly 
that in each category of lateralism the relation of any two variates, either of the 
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manual group or the ocular group, is very largely independent and that the 
degree of laterality can in no way be treated as absolutely fixed for all characters. 

(b) On the Quantitative Interrelations of the Dominance of the Ocular Characters. 

We have thus far investigated the relationships of laterality in various manual 
and ocular characters by comparing the percentage occurrences of the observed 
value with those of the independent values for each type of laterality. We will 
now consider their quantitative associations by deducing the correlation values, 
i.e. those for r and ». If the theory of absolute laterality has any truth in it we 
should expect to find that one difference in visual character was closely associated 
with all the other differences in visual characters studied. The same should also 
be true for the manual characters. Let us first investigate the interrelations of 
the differences of the visual characters. 

(1) Difference of Near Point (B,) with those of the other four visual characters 
(B., B;, B, and B;). 

The values of the product moment coefficients for the dominance of Near Point 
and that of the other characters are stated below. The number of subjects tested 
is given in brackets. 

Dominance of Near Point and Visual Acuity: rg,,,=*1122 + ‘0327 (414). 

Dominance of Near Point and Horizontal Refraction: BBs = "1028 + °0539 (388). 

Dominance of Near Point and Vertical Refraction : rg, , = °0069 + ‘0342 (389). 

Dominance of Near Point and General Astigmatism: 7, 2, =°0206 + °0333(411). 

It will be seen that all these coefficients have a positive value but that they 
are of very low intensity. The only one of the above coefficients which may 
probably be regarded as significant is 7,, z, correlating the lateral difference of Near 
Point and Visual Acuity—this has a value 3°5 times its probable error. It means 
that an increase in the dominance of the Near Point, i.e. of near vision, is associated 
to a very small extent with increase in the dominance of distant vision. The 
coefficient measuring the relation of Near Point and Horizontal Refraction is just 
three times its probable error which probably indicates a slight relation. In 
other words, the eye dominant for near vision will be very slightly dominant for 
horizontal refraction. 


The correlation ratios are as follows* : 


B, on B,: 7? =°039241, 7 = 028986 + -007856, 
in on B,: .=°028034, 7 =°012077 + 005115, 
(B, on B,: 4? = "030875, 7 = °018041 + 006437, 
\B, on B,: 2 = 029833, 7 = 012887 + 005455, 
(B, on By: 4? =°034814, 7 = -020566 + -006854, 
|B, on By: y? =°010406, 7 = -012853 + -005440, 


(B, on B,: *=°029790, 7 =°014599 + -005636, 
1B, on By: 4? =°017959, 7 = -012165 + -005152. 


* 92 = (x — 1)/N + °67449 27? (1 — 9)/(N +1). 
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In seven out of the eight cases the value of y° is insignificant when compared 
with the corresponding value of 7. In the fourth example shown the difference 
between »? and # is about 3:1 times the probable error, thus lying just on the 
border line of significance. 

Since the above results show no correlations of lateralities of any importance 
it has not been thought necessary to give correlation tables or diagrams. 

(2) Difference of Visual Acuity with those of the two Refractions and of 
General Astigmatism. 

Next, we shall consider the influence, if any, of dominance of Visual Acuity on 
the dominance of both the Refractions and General Astigmatism. The correlation 
constants deduced from the figures of Tables V and VI are given below. 

Product moment correlation coefficients 
'Ba pq = + 2056 + 0326 (393), 
Correlation ratios 
(B, on By: 9» ='2487, 7?='061832, 7° =-020356 
|B, on B,: 1 ='2331, 4? =054325, 7 = 030534 
I’ pa By = + 2062 + 0325 (395), 
(B, on B,: 1 ='2396, »*='057399, 7 = 020253 + 006751, 
1B, on B,: 1 ='2847, »*="'055094, 7 =-030380 + 008226. 


+ ‘006786, 
4 


- 008268, 


Clearly the values of 7,,,, and rz,,, show the same order of intensity and are 
considerable enough to be significant. This suggests that improvements of the 
refractions in dioptres are to some extent associated with increase of dominance 
of Visual Acuity. We have also four correlation ratios of approximately the same 
value, a little higher than 7, and certainly significant. 

Diagrams I and IT show the two pairs of regression lines, from which we can 
see that the regressions of B, and B, on B, are nearly linear and can, accordingly 
be fitted with linear equations. 

(a) Probable B, =*72467B, + 19974. 

(b) Probable B,=°74429B, + 15740. 

The regressions of B, on B, and B, are not so simple, they show some degree 
of curvature and have been fitted with cubic equations. 

(c) Probable B, =— 012216 + 0753281 B, — 00963851 Be — 000372511 BY. 

(d) Probable B, =—-003739 + -0506592 B, — 00904642 8? + 0005074568. 

The value of zero in B, or B, corresponds to — ‘01222 of B, in equation (c) and 
to — 00374 of B, in equation (d) which differ very little in amount. In other 
words, the chance of the dominance of Visual Acuity gives a somewhat similar 
chance of the dominance of two kinds of Refractions. 

By the usual methods the correlation constants for the dominance of Visual 
Acuity and General Astigmatism are as follows: 


B, on B,: °="018334, 7 ="014388 + 005556, 
. — R19 of) 2¢ rd { 2 5 
6 ited cael eal (B, on B,: 9? ="008572, HF = "028777 + 007799. 
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Difference of Visual Acuity measured by Snellen’s Type reduced to decimals. (Central Values.) 
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TABLE V. 


Laterality of Visual Acuity and Refraction in the Horizontal Meridian. 


Difference of Refraction in the Horizontal Meridian in Dioptres, (Central Values.) Right minus Left. ' 
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Laterality of Visual Acuity and Refraction in the Vertical Meridian. 
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TABLE VI. 


Difference of Refraction in the Vertical Meridian in Dioptres. (Central Values.) Right minus Left. 
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LATERALITY OF VISUAL ACUITY & REFRACTION IN THE HORIZONTAL MERIDIAN 
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°o--o » ’ ' Visual Acuity on Refraction 


’ 
> 
} 


' 

a 

J 
I 





Difference of Refraction ia Diogtres (Right minus Left) 
I 
| 


















































° 
| Lager 
rs si ih i gc eh eh eles = ag ae 
“oH ‘* pavers beet 
| 
20h 
KS SS SS SA SS I I I I I I 
~60 ~40 20 40 60 
Difference of Visual Acuity measured by Srallez's Type reduced to decimals (Right minus Left) 
Diagram I. 
LATERALITY OF VISUAL ACUITY & REFRACTION IN THE VERTICAL MERIDIAN 
He as 
< 
ail 
3: 
wan ©. --@ Regression of Differences, Refraction on Visual Acuity ; 
O--@ " " ” Visual Acuity or Refraction | 
ood i 
3 
3 ' 
a | 
@ ' 
“Ee ! 
ee -20 
~ 
2 | 
= | 
g Bh: 
a) = ey 
¢ f t f-® e 
= | gq e —— 
g | 2... o* = MEAN | 
as oH ‘, * cbse =. — ee — = 
2 4 | 1 fe) : 
‘3S : d \ 
: ff —— I 
3 t ' 
3 | 
L ' 
2 
= ' 
QA 20h \ 
' 
' 
' 
b+ ' 
| 
| 
\ 
= I r I I = I = I r 5 I I = 
--60 40 ~20 0 20 “40 60 


Difference of Visual Acuity measured by Srellen's Type reduced to decimals (Right minus Left) 


Diagram II, 





a 
- 





efraction in the H 





Difference of I 





4 





T. L. Woo 


101 


These values are all insignificant if compared with their probable errors, and 


don 





1inant for General Astigmatism. 


(5) Correlation of the Differences of the two Refractions. 


accordingly we cannot assert that the eye dominant for Visual Acuity will be 


The data from which the following correlation values have been calculated are 
given in Table VII: 


ps B, = “6642 + ‘0184 (383), 
B, on B,: = °6700, 
B, on B,:  »=°6710, 


TABLE VII. 


n° = 448941, 
n* = 450256, 


me 


” 


‘020888 + -006961, 
7 = ‘018277 + 006521. 


Laterality of Refractions in the Horizontal and Vertical Meridians. 








ian in Dioptres. 
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All these values are significant. They show that an individual with a superior 


horizontal refraction of one eye will also show a superior vertical refraction of the 


same eye. In Diagram III the two regression lines are linear and have been fitted 
with the following equations: 





(a) Probable B, = -64344B, + 08975. 
(b) Probable B,=°68563B, + -03513. 
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(4) Difference of General Astigmatism with Differences of the two Refractions. 

The last two pairs of ocular characters to be studied are General Astigmatism 3 
with each of the two Refractions. The following constants can be deduced from 
the tabulated data: 

B,on B,: 7»? = "027564, 7° =-015228 + 005878 
ns Bs = 0843 + (0337 (394), f ; 4 ; zi ; 
Ba Bs 7 Emel (992), 1B on By: = 025033, 7 = 017766 + 006341, 


B, on Bs: 4? = "017729, # = 015190 + 005862, 
B, on B,: 4? ="051185, # = 020253 + 006751. 


| 3) 


py py = 1039 + 0336 (395), | 


LATERALITY OF REFRACTIONS IN THE HORIZONTAL & VERTICAL MERIDIANS 
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Diagram ITI. 





[t will be seen that the first rg,,, is only 2°2 times, and the second rz, ,, is nearly 
three times its probable error thus showing probably a slight degree of significance. 
Only the last of the four correlation ratios is likely to be significant because here 
the value of *y,,, minus 7,,,, is about 45 times the probable error of 7s, ,,- 
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From the above results we may conclude that there is a high correlation 
between the dominance of the two Refractions and a moderate correlation between 
the dominance of Visual Acuity and that of each of the two Refractions. The other 
correlations do not really give any value of appreciable significance. Therefore it 
is clear from these results that it is not true that the lateralities of any two ocular 
characters are always highly associated. 

(c) On the Quantitative Interrelations of the Dominance of the Manual 
Characters. 

There are fifteen correlation coefficients for all the possible pairs of the six 
manual characters. For convenience we shall study them in groups. 

(1) Difference of Grip (A,) with those of the three Pulling Tests (A,, A, and 
A,), Steadiness (A;) and Balancing (A,). 

The data for the comparison of Grip and First Pull are given in Table VIII. 
We can calculate the following correlation values : 

4, 4q= 1686 + ‘0308 (454), 
A, on A,: »='2624, 7?='068868, 7° =-026432 + 007173, 
3 on A,: »='2605, »?=°067872, 7=°030837 + 007730. 
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The value of 74, 4, 18 small but significant. The two correlation ratios are practically 
identical and are larger than r4,4,; they are clearly significant. The two skew 
regression lines are shown in Diagram IV. They were fitted with cubic equations 
by the least square method and give: 

(a) Probable A, =4°70583 + 263401 A, — 0180623 A,? — -00039726 A.’. 

(b) Probable A,=— ‘00083 + 439087 A, — ‘0187809 A.2 — °000995204.;. 

From the above diagram it will be seen that the enbies for the probable dominance 
of grip to pull and pull to grip are not similar, A maximum 4, of 554610 Ibs. is 
found when A, is 6°07424 lbs. while the maximum A, of 1°81488 lbs. is found 
when A, is 7°36939 Ibs. 

The correlation constants for difference of Grip with those of the remaining 
four manual characters are found to be: 
1 si.4g = 0842 + “0316 (450), {A ,on A,: *?=°0515 = re panes + porciabi 

‘ {A, on £.: : 27° = 038839, WF = °031111 + 007798, 

A,: 7° =°017833, 74%= 031320 + 007851, 
,on A,: »*=°032020, 7 =°031320 + 007851. 
1: 9=017382, 7 =-019608 + 006167, 
A,: °="034848, 7 =-030501 + 007648, 
,on Ay: 7?=°015760, 7 =-019824 + 006233, 
A, on A,: 9?=°034665, 7 = 030837 + 007730. 
The values of the correlation coefficients for the dominance of Grip with the 
Last Pull and Lack of Endurance of Pull are a little less than three times their 
probable errors but may possibly be significant; the last two coefficients are 
obviously insignificant. All the correlation ratios except the first one are too small 


+-~ 


41.4, = °0909 + °0316 (447), 


14,4, = 0480 + 0314 (459), 


V4, 44 = °0525 + °0316 (454), 


> 
s. 


allow of any significant skew association between the dominance of Grip 
cnd the four manual characters here studied. 

(2) Difference of First Pull (A4,) with those of Last Pull (A.) and Lack of 
Endurance (A,). 

The First Pull test is designed to measure the initial muscular strength of 
the hand, while the Last Pull gives a value for the muscular strength after 
thirty seconds continuous pulling; the difference in pulling gives the muscular 
strength lost. Since the interrelations between the dominance of these three 
characters are important we will discuss them in detail. 

The correlation constants are calculated from the data given in Tables IX, X 
and XI and set forth below: 

V4.4, = 2307 + °0301 (452), 
(A, on A;: »='3198, *?=-'102271, #7 = 028761 + 007490, 
{A,;on Ay: »='2752, »?°='075758, 7 = -026549 + 007205, 
1 4,4, = 6058 + 0201 (450), 
(A,on Ay: »=°6127, °='375398, = 031111 +:007798, 
|A, on Ay: 9 =°6059, ?='367056, 9 = 028889 + -007523, 
1 4544 = — 6357 + 0189 (450), 
(A,on Ay: »="6359, °=-404314, = 033333 + 008062, 
(A, on A,: 7=°6351, = -403408, 7 =-028889 + -007523. 
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It will be seen that all the coefficients are large and significant, indicating a 
distinct association between the dominance of initial and final muscular strengths 
and of the lack of endurance of strength in pulling. The first coefficient shows 
that a dominance of one hand for First Pull is associated to a moderate extent with 
the dominance for Last Pull while the second coefficient tells us that the greater 
the dominance of the initial muscular strength the greater the dominance of final 
strength will be decreased. 14,4, with a negative sign indicates, what we might 
expect, that the greater the dominance of Last Pull the less will be the dominance 
of Lack of Endurance in Pulling. 


The three pairs of correlation ratios are all significant and the regression lines 
are shown in Diagrams V, VI and VII. The regression lines of A, on A, and A, on 
A, shown in Diagrams V and VI are slightly skew and therefore will be fitted 
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with cubics while the others being linear have been fitted with linear regression 
equations. These six equations are given below: 


-20-0 


Difference of Stren Sth of 157 Pull iz Pourds 


(a) Probable A, =‘22387A, + °48837 (in terms of A,). 
» »” A,). 


(b) Probable A, = 498724, + ‘17982 sl 


(c) Probable A,=— ‘30691 + 2863514, —‘00113994.? 


— 000645294. ( ,, oop GER 


(d) Probable A,=— ‘530374, + 20147 joe 
(e) Probable A,=+°05952 +°726313A, + ‘01354074, 

— -000022244,?(,, 4 , Ao) 
(f) Probable A,=—*76194.A, + 57460 be) ee 


In order to throw more light on the nature and extent of muscular endurance 
in pulling, it may be worth while to study the correlations of the absolute 
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measures of the two given variates for right and left hand respectively. Tables XII 
and XIIT (p. 113) provide the data for both variates upon which the following 
numerical results are based, 2 lbs. being taken as a working unit for each variate. 














Characters Mean S.D. 
és = ae $$ — | ——________ 
Strength of First Pull for right) : : 
s 8 40-096 + -229 Ibs, | 7-221 + 162 Ths 
hand (d2,) 453 cases as ) Ibs. 7221 + “162 Ibs. 
1 A eo ee a a SS ean 
Lack of Endurance in pulling ai | 15-089+°173 Ibs, | 5°462+-122 Ibs. 
right hand (a4,) 453 cases = 
Strength of First Pull for left) | 39°587 + -207 Ibs. 72044161 Ibs. 
hand (a.,) 458 cases f = 
Se eee On re ee. ee — 
Lack of Endurance in pulling for \ en a 
left hand (a,,) 458 cases 14°369 + °166 lbs. | 259 + °117 lbs. 





= 5639+ 0216, 74,,,a4,= "4279 + 025 


(dap ON Ap) 

n="5829, n?='339803, 79?="022075 +°006800 
(@4n ON A3p) 

n='5727, n*?=*327960, 7?=°024283 + 006892 
(dg, ON M4,) 

n='4598, ?="211451, 92=°019651 +-006179 
(41, OD Gay) 

= "4460, ?='198943, 9°?=*026201 +‘007111 


"aor, TR 


From these results it may be seen that the average strength of First Pull for 
the right hand is about ‘5 lb. stronger than that for the left, and the reducing 
amount after exertion, i.e. Lack of Endurance in pulling, for the right hand is nearly 
‘7 lb. more than that for the left. The value of 100@,/@, (@,= mean of Lack of 
Endurance and @, = mean of First Pull) for the right hand is 37°6 per cent. and 
that for the left 36°3 per cent., which are nearly identical. 

The values of the correlation coefficients in the case of both hands are consider- 
able although they are less than the values of the correlations of their relative domi- 
nances, These high values indicate that the increase of the value of one variate gives 
a proportional increase to the other, no matter which hand is tested. The reason 
may be possibly accounted for thus: Subjects who exert excessive muscular 
power at first pull become exhausted in strength earlier and will be more easily 
affected by a factor of physical fatigue than those who give a lower record at first 
pulling. The two pairs of correlation ratios given here are also essentially large. 
Diagram VIII (p. 114) shows the changes of the two pairs of regression lines for 
each hand. The reader will observe in the Diagrams that the regressions of a, (Lack 
of Endurance) on a, (First Pull) for both hands follow more or less similar straight 
lines. On the other hand, the regression curve of a, on a, for the right hand 
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changes fairly steadily; while the amount of a, gradually increases through a 


range of 


26 lbs. there is a nearly proportional increase of a, which totals 
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about 18 lbs. 


Differerce of Lack of Erduravce of Pull ir Pourds 
Diagram VII. 


On the left, however, as the value of a, increases from 4 lbs. to 


11°5 lbs., there corresponds a constant value of nearly 36 Ibs. in a,. After a, has 


increased up to 15:5 lbs., then the two variates correspond almost proportionally 
with one another in change. 


According to the nature of their curvatures, the regression lines of a, on a, 
for both hands have been fitted with cubic and those of a, on a, with linear 
equations. The equations are given below: 

(a) Probable a,p, = 33°19652 + *202180a,, + °0123439a?,, + 00012010a°,,. 

(b) Probable a,p = *42669a,p — 2°02156. 


(c) Probable a,, = 49°86460 — 3°693951la,, + °3012458a2,, — 00668708a3,,. 
(d) Probable a,, =°31237a,,; + 2°00821. 





T. L. Woo 113 


TABLE XII, 


First Pull and Difference of First Pull minus. Last Pull in Pounds for Right Hand. 


Strength of First Pull in Pounds for Right Hand. 





(Central Values.) 
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TABLE XIII. 


First Pull and Difference of First Pull minus Last Pull in Pounds for Left Hand. 


Strength of First Pull in Pounds for Left Hand. (Central Values.) 





















ais Silrlalalalalnrlaealalalalcle elalnialajalpalir 
~ NININ[ OM Ol Hl olmoi ml Atl Hi ti Mi moepwplywmelywolsol;ol;ol]ol]s | 
a 4 = 
3 |-25|— 3 2 aa l ey ES 2) (SES = 7 
= 0-5 -|— — er l - 2S eS SS ee eS ee 
& | EsI—l- : aol ey ee SS a es eS ee ey ee 
B.A} 35 ff tae BU) OE el SS ee —|—}—|—|—I 6 
=—=2% |] 55/—/1] 1 20h b= Sit = a -}13 
Es) FoI—| LST) Si) 41 31s Bl aS os a) = -135 
ne | OS 211\2/4|6/13| 3}1]/3] 2/3] 2 By ee Ei Tee 42 
4 | Ibs baa) Poe Orie] 21o1 Se) S12 | SSS Se .|—] 76 
AE | 135 —| 1|/—] 5| 4]10)12} 8} 7| 9} 3} 1)—] 3 l —} 1 |—|—|—165 
S| 155]1] 111} 2) 3) 7] 9/12] 7] 7/10) 5) 3] 2) 3] 1)—|—|—|—]1|—|—]1% 
17-5 -|—|1/4]2|] 7] 7}/5|/8] 447) 4/1]/1])3]4/—] 1/—] 1|/—|—[6 
— | 19-5 | =] UL al SiS SL et St Sl Si— 13 6 ae Se ee SS 
Ee |— —| 2] 31 113]1] 3}2]4]5]2)—] 1j—j—|—|—|—|—]27 
S | 23-5] —| 12] 1] 1]—/—] 2]1]2]1/—jJ—] 3]/—|]—]—|—|—|—]18 
ee i Sn Ges Fe es "y Ae fen le OS OE 
S 27-5|—| SS ae ee (ie ee ee a ea, (eee ee el Pe, 
= 29-5 | — | os coe —|—.}—| } Se 8 ee ee ee CU 
oS «| SL-6h— £ : — 1 | —j/—j—|/—]—j—/|—| 1 |—] 2 
S | 33-5]— . % eS . a “ae Se ee ees 
& | 35:5)—| = ais, (Ae. oe : 5 —|—|—|—|—] 1 
5 | 2 | 6 | 12] 19) 31) 39) 65) 48) 41) 44 49) 32/23) 11) 14) 5 |11) 0] 1 SEVERE: 458| 





Biometrika xxA 





‘IITA Mvsserg 


























Sspung ur 11°d jo sdUeInpUyY j? yor] spunog u "dd jo sourInpUy jo 4Pr'T] 
0-05 0-02 0-0} 0 0-0¢ 0-02 0-0} 0 

) 1 it L i l 1 l L l l af l i l 
> i i 
Q 
S | 
= 0-0¢ F | 0-0 
re / 
Sg —1g we 
Ss = : iS = 
~ 4 { S 

& | o& 
‘> & | : 4 
> eee 0-07 &, 
= ‘ f 
~~ 1 } ‘8 
S oy PS 
S ~ = - 
= 2 
Ss 5 3 
D xu 
- ~ 0-0SH OOS & 
~ & 
> 8 5 
Ss + = 
~ LU | Pd ast | lrdish | 
s ! wo a0 soursnpur” jo y9e"] O---O 6) j YO [Mg JO BuKsnpuy Jo YR] O---O 
> l 11"d J? | td 3° 
> 0-09 QNYH J197 z, BTUPINPUT JO OX] UO | is} @---@ GNYH LHDIY 3! aouesNpUY JO YOR] UO [ING ss} @---@ H O09 
S zi 2 





QNWH La1 2 QGNVH LHS YOu 
TINd AO TINVUNGNG dO MOWT B 11Nd ist AO HLSNGULS 


114 






T. L. Woo 115 


(3) Differences of Three Pulling Tests (A., A; and A,) with those of Steadi- 
ness of Hand (A;) and Balancing (A,). 

Here we have to deal with the three differences of the three pulling tests for 
the two hands associated respectively with the differences for steadiness and 
balancing measurements for the same hands. 

As usual all associational constants were obtained from the separate data. 
Their results are given below: 


Product Moment Correlations Correlation Ratios 


(A, on A,: 9°="015549, = 019737 + 006206, 
\A, on Ay: 1?= "043202, 7 = 028509 + 007425, 
(A, on Ag: 9° ="011502, 7 ='019956 + 006275, 


4,4, = 0663 + 0315 (456), 


4,4, = 0695 + 0316 (451), 


|A, on A,: »?=*023020, »? =°028825 + 007508, 
™ i i (A, on A,: 7°='013987, 7 =-019956 + 006275, 
. ont ( (2 Fl) 7 
"4,45 = 0679 + 0316 (451), 1A, on Ay: 4? = 034959, 7 ="028825 + ‘007508, 
(A. on A,: »?='020033, 7=:020270 + 006371, 
l' 45.4, = (0045 + 0320 (444), pee z a *. 399'7¢ 007622 
oa |A, on A;: 9? =°048800, 7 =°029279 + 007622, 
A,on A,: 7? ='016597, 7 ='020000 + 006288, 
tan 182 Gen, SO Ce Le eee 
tea |A; on Ay: 9? ="051382, a =-033333 + 008062, 
A, on A,: n? =°012879, 7 =-020270 + 006371. 
4 4¢ = 0628 + 0819 (444), JAS OR Ast 7 et Le eee 
— A, on A,: n? = "035844, 7 =°033784 + -008169. 


All the correlation coefficients are insignificant. Again; none of the correlation 
ratios can be regarded as of definitely significant value, when compared with 
the corresponding value of #, the correlation ratio for no association. Thus we are 
unable to assert that any important relationship exists between the dominances of 
any pair of these given six variates. 

(4) Difference of Steadiness of Hand and Balancing (A, and A,). 


Lastly, let us turn to the dominance of Steadiness of Hand and Balancing. 
We deduce from the data for the given two characters the following values : 

Product Moment Correlation r4, 4, =*0040 + ‘0315 (457). 

Correlation Ratio of A, on A,: n°? = 036503, 7 = 019694 + 006194, 

a ee A, on A;: 7? =°'018632, 7 = "019694 + 006194. 

Thus, the correlation coefficient is much less than its probable error and, con- 
sequently, of no importance. This seems to indicate that any orderly change of 
dominance in Steadiness of Hand has little influence on that of Balancing. The 
correlation ratio of A, on A, is comparatively large but still hardly significant 
when the difference of »*, 


s4¢ 2nd Fy, 4 
probable error. 


, is considered, i.e. only 2°7 times the 


The foregoing results show that while for certain of the manual characteristics 
there is a considerable correlation between the dominant lateralities, yet for others 
there is no sensible relation at all. 


In other words we cannot assert that if the 
one hand 


is dominant over the other for one character, it will be to the 
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same extent dominant for all the other characters. There appears to be no such 
attribute of universal laterality of the hand. For some characters there is a 
fairly high correlation, for others none at all. An individual may have any degree 
of right- or left-handedness, and it would appear that he may be right-handed for 
one character and left-handed for a second. 


It will be clearly seen from Table IVb that among 15 pairs of manual 
characters the value of opposite laterality varies from 21°11 per cent. to 60°88 per 
cent. giving a mean of 34°52 per cent. That is to say, there is at least one- 
third of all individuals who are dextral in one manual character but sinistral iv the 
second, Even with such a high correlated pair of manual characters as First Pull 
and Lack of Endurance in pulling, there is still more than one-fifth of all persons 
who fail to obey the rule of absolute Manual Laterality. 


Problem 4. Is there any Truth in the Theory of Universal Laterality, 1.e. is 
there any association between dominance in hand and eye? 


(a) On the Relationships in Percentage Occurrences between the Dominance of 
Ocular and Manual Characters. 


Before we proceed to test the numerical relations between any pair of ocular 
and manual characters by the usual correlation methods, we will first study the 
correspondence of percentages between the observed and expected values in each 
category of laterality. The process followed is exactly the same as in the former 
cases. Table XV (pp. 115—125) provides the thirty combined distributions for 
laterality of any two variates of hand and eye. Three kinds of percentages for each 
type of laterality are given in the table. First, those of each cell on the total of 
the table, then of the values to be expected on the assumption of no association 
and lastly, the percentage of the cell frequencies on the column total. We note 
that : 


(1) The differences between the percentages actually observed and those to 
be expected are practically very small. In most cases they only differ in the 
decimal places, which might easily occur owing to random sampling. The largest 
difference is merely 3°53 per cent. (observed = 42°30 per cent. and expected 

38°77 per cent.) in the case of manual dextrality and ocular ambilaterality 
in the Steadiness test associated with the Near Point. It is only about 1°52 times 
the probable error of their difference, and therefore not of significance. It may 
be remarked, however, that this is one of the cases in which the eye is auxiliary 
to the hand. 


(2) When similar comparisons are made bet»veen the percentage in any column 
and the corresponding percentage in the marginal column, we find in 98 cases out 
of 100 the differences between them are insignificant. Those differences which are 
less than 10-0 per cent. in amount seem to indicate no significance. For instance, 
in the case of First Pull and Vertical Refraction (see Table XV, No. 9) in the 
category of manual dextrality and visual sinistrality, the value is 9°77 per cent. 
(63°10—52°33 per cent.) which is just equal to 2°41 times the probable error of 
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Dextrality and Sinistrality of Hand and Eye 


TABLE XV. 


Totals with Percentages observed and Percentages expected on the basis of no Association for 


Manual 
Laterality 
for Grip 


Sinistrality 
Ambilaterality 
| Dextrality 


lo 
Totals 


Manual 
Laterality 
for Grip 
Sinistrality | 
Ambilaterality 
| Dextrality 


| 
Totals 


Manual 
Laterality 
for Gr ip 


Sinistrality 
Ambilaterality | 
| Dextrality 

| 

Totals 


Manual 
Laterality 
| for Grip 


| Sinistr: ality 
Ambilaterality 
| Dextr: ulity 


| Totals : | 


each Category of Manual and Ocular Laterality. 
(1) Grip and Near Point. 


Ocular Laterality for Near Point 


Sinistrality q “Ambil: ater: lity Dex xtrality | ay 
’ } Totals 
eae a ee. ae 


% of Totals } % of Tote als 











| 
| 
| 0, g } | () 
% of |. mi of « {, of Tots ul | of of | 
Obe. | Exp | Cotamen She | Exp. xp. |‘ funn Obs. Exp. >. | ‘olumn| 
| | | | 
13 (3-16) | (2-31) | [19-70] | 35 (8:52) | (8-56) | [14-29]| 11 (2-68) | (3-49) | (11-00) | | 59 (14-36) | 
3 (0-73) | (0-98) | [4:55]! 16 (3-89) (3-62) [6-53]| 6 (1-46)| (1-48)| [6-00]| 25 (6-08) 
50 (12-17) | (12-77) | [75-75] | 194 (47: 20) | (47-43) | [79-18] | 83 (20-19) | (19-36) | [83-00] | 327 (79-56) 
66 (16-06) | 245 (59-61) 100 (24-33) |4il (100) 
(2) Grip and Visual tie: 

Ocular Laterality for Visual ae | 
Sinistrality | Ambilaters lity Dextrality Totals | 

°/ of Tots °/ of Totals % of Totals | % 
6 of Totals | © of 4 of Total: | 0% of ‘o % of Total o% of | 
Obs. Exp. wae Obs. Exp {Column | Obs. | Exp. ‘ olumn | 

nse, | 
26 (6-21) | (6-46) | [13-98] | 11 (2-63) | (2-40) | [15-94] | 24 (5-72) (5-70) | [14-63]| 61 (14-56) | 
13 (3-10)| (2-44)| [6-99]| 3 (0-71)| (0-90)| [4-35]| 7 (1-68)| (215)| [427]| 23 (5-49)| 
| 147 (35-08) | (35-49) | [79-03] } 55 (13-13) | (13-17) | [79-71] | 133 (31-74) | (31-29) | [S1- 1-10] | 335 (79-95) | 
186 (44-39) 69 (16-47) 164 (39-14) | 419 (100 
(3) Grip and Refraction in “ Horizontal” Meridian. 


Ocular Laterality for Refraction in “Horizontal” Meridian 


Sinistrality Ambilaterality Dextrality | Tots 
‘ =: i - ys b | otals 
a - Totals le % of % of Totals % of | % of Totals | ” of | o 
Che: | Exp. Column Obs. Exp. \cofumn Obs. Exp. le olumn 

| — a 
15 (3°84) ) (3-30) | [16-67] | 16 (4-09) | (4-40) | [13:33] | 25 (6:39) | (6-62) | [13-81] | 56 (14-32) 
6 (1-54){ (1-41)| [6-67] 9 (2°30)| (1°88)} [7-50] 9 (2:30) | (2°85)| [4:97]| 24 (6-14) 
69 (17-64) | (18°31) | [76-66] | 95 (24-30) | (24-41) | [79-17] | 147 (37-60) | (36-82) | [81-22] | 311 (79-54) 





90 (23-02) 120 (30-69) 181 (46-29) | 391 (100) | 


(4) Grip and Refraction in “Vertical” Meridian. 


Ocular Laterality for Refraction in “Vertical” Meridian | 


Sinistrality Ambilaterality Dextrality 


% of Totals | % of Totals 70 


| 
| 
; | Totals | 
% of Totals | | 
J 
= 


% of % of % of 
Obs. Exp. Column Ola. | Exp |}Column Obs. | Exp. Column 
13 (3-31) | (3-16) | [15-48] | 21 4 34) | (4:92) | [16-03] | 24 (6-11) (6-68) | [13-4 $I 58 (14-76) )| 
5 (1-28) | (1-20)| [5-95] | 9 (2-29)| (1-86)| [6-87] | 8 (2-03) | 2-54) | [4-4 22 (5-60) 
66 (16-79) | (17-02) | | L78- 57} | 101 2 5-70) | (26-55) | [77-10] | 146 (37-15) | (36-07) [82- 03 | 31: 3 (7 9-64) | | 
84 (21-38) | 131 (33-33) | 178 (45-29) r 393 (100) | 











ils 
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(5) Grip and General Astigmatism. 



























































































| Ocular Laterality for General Astigmatism 
Manual Sinistrality ~ Ambilaterality A " ‘Dextrality | Totals 
| Laterality |___— a, Sree 2a | ae ee 
for Grip % of Totals | -&% of Totals ‘| % of Totals | /o 
PRE 1 Ge. I ci mii anew eet ewe aan _| % of | 
Obs. | Exp. |Column | Obs. “Exp. ~ |Coiumn | Ghana: | Exp. {Column | 
| Sinistrality | 17 (4-09) (40: 25) | ie 7]| 26 (6-27)| (6-87) | [13-40] 18 (4: 34) | (3° 58) | [17 *82] | 61 (14-70) 
Ambilaterality | 4 (0-97) | vi 60) 33] | 15 (3-61) | (2-59) [7-73] | 4 (0-96) } (1-35) | (3 3] | . (5-54) 
|Dextrality 99 (23+ (23-85) | (23-06) | roan 50] | 153 (36-87) | (37-29) | 78.97) | 7 (19-04) (le. 41) U7 [78-22] | 331 (79-76) 
| Totals 120 (28-91) | 194 (46°75) _ | 101 (2434) | 415 (100) _ 
(6) First Pull and Near Point. 
| | Ocular Laterality for Near Point | 
| Manual ae Mine Si A a Cas 9 se —e" | 
| Laterality Sinistrality iS i Ambilaterality Dextrality | Totals | 
| or Of of T - oe. =o a ae, ae i % 
| Birst Pull % of Totals o% of | % of Totals * % of | F % of Tots als | o% of 
| Obs. | Exp. r ‘olumn | Obs. | Exp. SS neies Obs. iE Exp. {Column | | 
| sinistrality 18 (4-44) q (6-14) ) |f27-6 9}* «| 97 (2 3-9: 5) | (22-49) | [40-76] | 40 (9 88) (9-64) | [39-22] | 155 (38- 27) 
| Ambilaterality | 9 (2-22) | tr 39) [13-85] | LL (2-72) | (5-08) | [4-62]] 15 (3-70) | (2:17) | [14-70] | 35 (8-64) 
Dextrality | 38 (9 39) | (8 52) |[58-46] | 130 (32-10) | (31-20) | [54-62] | 47 (11-60) | (13-37) | [46-08] | 215 (53-09) | 
Totals 65 (16- 05) 238 (58-7 7) 102 (25- 18) 405 (100) 
(7) First Pull and Visual Acuity. 
Ocular Laterality for Visual Acuity | | 
| M 2.3 ae ee, - ae? 7 it rag ee 
anual a ae a isan l 
| Laterality nlatealiy ect <4 ses oe i De xtrality a | Totals | 
| o/ "otals ° Totals | | o/ Totals | % 
| m.. Pull __% - ness | % of 6 of Total: % of Re a Totals | © of | | 
| Obs. | Exp. a Obs. l Exp. \¢ oumn | Obs. | Exp. |‘ Jolumn 
| 
Sinistrality | 68 (16-46) | (16-56) | [37-57] | 27 (6-54) | (6-40) | [38-58] | 61 (14-77) an | [37-6 5] | 156 (37-77) 
Ambilaterality | 16 (3:87) | (4:03) | [8 sai] 0.97) | 1:56) | e711 | 18 (4:36)! (3-61) |[11-11]| 38 (9-20) 
| Dextrality | 97 (23-50) | (23-2 53-59] 39 to. 44) | | (8.90) [55-71] | 83 (20-09) | (20-80) | [51-24] | 219 (53-03) 
| Totals | 181 (43 3-83) | 70 (16-95) 16 wh ci 22) | 413 (100) 





(8) First Pull and Refraction in “ Horizontal” Meridian. 


———__-—- 

















Ocular Laterality for Refraction in “ Horizontal” Meridian | 

Manual a -” 2 “ae EOE er as Deena © 

Laterality _ “Sinistr ality . | Ambilaterality | ; ” xtrality | ‘Totals | 

First Pull % of Totals | of | % of Totals o of | % of Totals | o of o 

St _ : ie oO o | 

Obs. | Exp. |Column Obs. Exp. —_ Obs. | Exp. \¢ ofumn| | 

| , | | | 

| } | 

Sinistrality 32 (830) | (8-80) | [35-96] | 49 (12-79) | (11-47) | (42: 24 | 64 (16-71) | (17-59) | [35-96] | 145 (37-86) | 

Ambilaterality 6 (1-57) | (2 00} [6- 74| | 8 (2-09) | 2-61) | [6-90]| 19 (4:96) | (4-01) | [10-67]| 33 (8-62) 

| Dextrality B1 (13:30) | (12-43) | [57-30] | 99 (15-41) | (16-21) | [50-86] | 95 (24-81) | (24-88) | [53:37] | 205 (63:52) 
| Totals 89 (23° 23) | 116 (30-29) | 178 (46-48) 383 (100) 


| 


| 





120 Dextrality and Sinistrality of Hand and Eye 


(9) First Pull and Refraction in “ Vertical”? Meridian. 





| ~ Ocular Laterality for R fraction in ‘ “Vertical” ‘Meridian 
Dees e ee? a ee 
Laterality | Sinistrality | ~ Ambila ateré ality, | ‘De xtrality Totals 
for cia = -_— fk a a ey ma a" a ORES % 
. Totals als | f als — 
x... |  % of Totals | ea 3 % of Tot ls | oo | %° Tot ils | o of 
| Obe. ina. \Column | Obs. Exp. i olumn | Obs. | Exp. “|¢ Jolumn } 








| Sinistrality 7 4 99) | (8-40) | [32-14] | 54 (13-99) | (12-50) | [43-20] | 68 (17-62) | (17: = [38-42] | 149 Gs. 
| Ambilaterality 4 (1-04)| (1-97)| [4-76] 9 (2-33) | (2-94)| [7-20]; 22 (5- 70) | (4:16) | [12-43] | 35 (9-07) | } 
| Dextrality 53 (13-73) | (11-39) | [63-10] | 62 (16-08) sated [49-60] | 87 (22-54) | (24-00) | [49-15] | 202 (52- 33) | 


| Totals 84 (21-76) | 125 (32: 38) | 177 (45:86) | 386 (100) | 





(10) First Pull and General Astigmatism. 








mi Ocular Laterality for General Astigmatism | 
} et ne ee ae EE ee ae SE cccucmeas 
Manual rae | 
| Laterality Sinistrality ‘Ambilate Te ality De xtrality | Totals 
for Ca. |. 2k eae low ce eid a Te, % 
| First Pull | _* ” Totals | 6 cal _- Totals _| % of | _% of Tot als o% of | 
| Obs. | E inp Column | Obs. Exp. Column | Obs. & EB > 9 ici 
| | = — 
Sinistrality 44 (10-79) | (11-0 6) [37-29] | 76 (18-63) (17-62) | [40-43] 36 (8-82) | 156 (38-24) | 
| Ambilaterality | 10 (2-44)| (2°55)| [8-47]| 19 (4-66) | (4-07) | [10-11] | (1-72) 36 (8-82) | 
Dextrality | 64 (15-69) | (15-31) | [54-24] | 93 (22-79) | (24-39) | [49-46] | 59 (14-46) 216 (52-94) | 
Totals 118 (28-92) } 188 (46-08) 102 (25-00) 408 (100) | 
(11) Last Pull and Near Point. 
Ocular Laterality for Near Point | | 
, ene Sinistrality Ambilate rality | Dextrality Totals | 
| | oO s Oo f tals a 9% f tals | Oo 
| Last Pull |_ % of Totals | _ % of nS Total: % of “2 , of Total % of | 
| Obs. | Exp. jColumn Obs. | Exp. Column Obs. Exp. Column | | 
| 
| Sinistrality 26 (6°47) | (6-89) | [40-63] | 104 (25-87) | (25-52) | [43-88] | 44 (10-94) | (10-87) | [43-56] 174 (43-28) | 
Ambilaterality | 6 (1-49)) (1-78) | [9-37] | 30 (7-46) | (6-60) | [12-66] | 9 (2-24) | (2-81)} [8-91] | 45 (11-19) | 
Dextrality 32 (7-96) | (7-25) | [50-00] | 103 (25-63) | (26-84) | [43-46] | 48 (11-94) | (11-44) | [47-53] | 183 (45: 53) | 
| : on” (k0.08 
Totals | 64 (15-92) } 237 (58-96) | 


101 (25-12) | 402 (100) | 


(12) Last Pull and Visual Acuity. 


Oc cular Late srality for Vv isual Acuity | 
a, | Sinistrality -_ ‘Ambilaterality. | Dextrality | Totals 
for — a= a ew  e o -™ ; : x 
| Last Pull | % of a _| &% of % of Potals | % of |_ Leadtetiateicll | &% of /o | 
| Ole. | Exp. ( Real Obs. Exp. | Jolumn Obs. | Exp. — | | 
; | 
Sinistrality | 83 (20-25) | (18-74) | [45-86] | 24 (5:85) | (7-04) |[35-29] | 67 (16-34) | (16-66) | [41-62] | 174 (42 44) | 
Ambilaterality | 22 (5:37) | (5-17) | [12-15] 6 (1-46) | (1-94) | [8-83] 20 (4 38) (4-60) | [12-42] | 48 (11-71) | 
| Dextrality 76 (18-53) | (20-24) | [41-99] | 38 (9-27)| (7-60) |[55-88]*| 74 (18-05) | (ls. 01) | [45-96] | 188 (45-85) 
| : u } | 
| | ~ | } 
| Totals | 181 (46-15) 6S (16-68) 161 (3°37) | 410 (100) | 
_ _ eee 





* An asterisk marke ¢ a . probably significant difference in percentage value, 





als 





8-60) 
9-07) 
2-33) | 


00) 





tals 


+ 
| 


43-28) | 
11-19) 
45-53) 


100) | 
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(13) Last Pull and Refraction in “ 








Horizontal” 


Meridian. 























Ocular Late rality for R efraction i in * ‘Horizontal” Meridian | 
} 
Manual See OS. aes eel ot oe Se A ee ee po 
hatesttity Sinistrality 4  Ambilaterality Dextrality | Totals 
for gee eae Re Bae = cs ieee oF - oo 
Last Pull ~% wien ”, of | % of Tots als | % of |_ % of Totals ! , % of | 70 
j ig | 
Obs. | Exp. “cota Obs. | Exp. S hasta 3 "Obs. | Exp. ey 
Sinistrality | 39 (10-34) | (10-03) | [44-32] | 38 (10-08) | (1: 2-99) | [33: 33] | 85 (22: 55) | (19-95) | [48-57] | 162 (42-97) 
Ambilaterality | 11 (2-92) | (2-78) | [12-50] | 18 (4-:78)| (3-61) |[15-79]| 16 (4-24)| (5:55)| [9 14] | 45 (11-94) 
Dextrality | 38 (10-08) | (10-53) | [48-18] | 58 (15-38) | (13-64) | [50+ 88] | 74 (19-63) | (20-92) | rit | 170 (45-09) 
Totals 88 (23-34) | 114 (30 24) | 175 (46-42) 377 7 (100) 
(14) Last Pull and Refraction in “ Vertical”? Meridian. 
Ocular ntittatniad for Refraction in “ Vertical” Meridian | 
Manual , ee | ae = 
Laterality Sinistrality | ‘Ambilate rality | Dextrality Totals 
for EE ie | Seana = —; meni ae = ‘i % 
Last Pull ee of Tot: als | % of | Peis of Totals | % of |__ % of Totals | , of | 
| Obs. | Exp. |‘ volumn | Obs. | Exp. Column | Obs. | Exp. | olumn | 
Sinistrality 38 (9-90) | (9-28) | [44-71] | 47 (12-24) | (13-54) | [37-90] | 76 (19-79) | (19-11) | [43-43] | 161 (41- 93) | 
Ambilaterality | 14 (3-65) | (2-65) | [16-47] | 14 (3-64) | (3-87) | [11-29] | 18 (4-69) | (5-46) | [10-28] | 46 (11-98) | 
Dextrality 33 (8°59) | (10-21) | [38-82] | 63 sichtie (14-88) | [50-81] | 81 (21-09) | (21-00) | [46-29] | 177 (46-09) | 
= 85 (22-14) | 124 (32: 29) | 175 (45-57) | 384 (100) | 
(15) Last Pull and General Astigmatism. 
| “Ocular ‘Laterality for Ge neré ul Astigm: atism | 
| I cael | Sinistrality | Ambilate ri ulity Dextrality | ‘Totals 
| - “yl all | % of Totals i , | % a Totals | sd F | % of Totals | 2 oa | % 
8 | © Of | 6 ot | } | o 0 
| Obs : | Exp. | ( ioe Obs. | Exp. \¢ olumn | Cha. Exp. Column 
= ; 
| Sinistrality 51 (12-62) | (12-54) | [42-86] | 70 (17-33) | (19-49) (37-84) | | 51 (12-62) | (10-54) | [51- 00) | 172 (42-57) 
| Ambilaterality | 18 (4-46) | (3-36) | [15-13]! 20 (4-95)| (5-21)|[10-81]| 8 (1-98)| (2-82) | [8-00]| 46 (11-39) 
| Dextrality | 50 (12-38) | (13-56) | [42-01] | 95 (23-51) | (21-09) | [51-35] | 41 (10-15) | (11-39) | [41-00] | 186 (46-04) 
wee | | 
Totals 119 (29-46) | 185 (45-79) | 100 (24-75) | 404 (100) 





(16) Lack of Endurance of Pull and Near Point. 





| Manual 
Laterality for 
Lack of En- 


durance of Pul 


| Sinistrality 


| Ambilaterality 


Dextrality 


| Totals 








Ocular | Late: rality for Near Point 


| Sinistrality | 


1| | % a of ' Totals 


Ambilate srality | 


! - | mine: = 7 - 
o;/ 
i 4 of 


p df i Yo of Potals | o% of 1. 
he. * Exp. | Column | Obs. | Exp. \C volumn Obs. 
26 (6-48) | (6-73) | [40-63] | 100 (24-93) | (24-69) | [42-55] 
7 (1:75) | (1-35) | [10-94] | 17 (4:24)| (4:97) | [7-23]] 10 
31 (7-73) | (7-88) | [48-43] | 118 (29-43) | (28-94) | (50-22) 
ee | 
| 64 (15-96) 235 (58-60) | 


% , of Tots ls. 


(2-49) 
49 (12-22) | 


" Dextz ality 


| %, of 


| Exp. | PR al 


| 43 (10-73) | (10-72) | [42-16] | 169 (42-14) 


(2-16) 
(12-56) 


[9-80] 


102 (25-44) 


| [48-04] | 
| 


oy 
/O 


| Totals 
| 
| 
| 
| 


| 34 (8-48) 
198 (49-38) 


401 (100) 














122 


Dextrality and Sinistrality 0 


f Hand and Eye 


(17) Lack of Endurance of Pull and Visual Acuity. 





Ocular Laterality for Visual vaanaiell 





Manual 
Laterality for 

| Lack of En- 
| durance of Pull - 


% of f Totals 


Sinistrality fey 
% ‘of’ Totals 
| 








wan s * Ea 
| S Obs. | £ ixp. | ain Obs. :E :xp. | 
| 
| Sinistrality £ (18-05) | (18-11) | [41-57] | 30 (7-32) | (6-71) | 
| Ambilaterality | '8 (1-95) (3°28) | [4-49]| 6 (1-46) | (1-22) | 
| Dextrality 96 (23-41) | (22-02) | [53-94] | 30 (7-32) | (817) | 
Totals 178 (<2 3° 41) ; 66 (16: 10) 





(18) Lack of Endurance of Pull and Refraction in “ 





Ambilate —— 











ee 











| Manual 
| Laterality for | 
| Lack of En- 


| durance of Pull | 


| Ocular Laterality for Refré rotion in ‘ 
} 


ely | 
_ _ | 


% of Tote als } % of Tote als 





~ Ambilateralit y 








| Dextrality | Totals | 
a Pr % 
of el % % of “Totals | 9 L of | 
{spt Obs. Exp. "SS nunez | 
[45-45] | 67 (16-34) | (16-89) | [40-36] 171 (41-71) 
[9-10] | 17 (4:15) | (3-06) | [10-24] | 31 (7-56) | 
[45-45] | 82 (20-00) | (20-54) | [49-40] | 208 (60°73) 
eel PTE es ud ae 
| 166 (40: 49) | 410 (100) | 
Horizontal” Meridian. 
» “Horizontal” Meridian 
| Dextrality Totals 
— — * 


% of Totals 


























35 (9-2: 


150 (39-58) 





3) 


194 (51-19) 


Totals 
o/ 


Oo 


| 155 (40-68) | 
(9:19) 


35 
191 (50-15 


381 (100) 


Totals 
% 


165 (41-4! 
35 (8-7! 


201 (50-12) 


| 401 (100) 


| | 
of | _.}| % af | Ae adetekobiil %, of 
| Obs. y [= xp. ( com ite. | Exp. \¢ mame Ote | Ex Ixp. jem 
Sinistrality | 32 (8-44)| (9-19) | [36-36] | 54 (14-25) | (11-91) | [47-37] | 64 (16-89) | (18: 48) | [36-16] 
| Ambilaterality | 7 (1-85)| (2-14)| [7-95]|} 9 (2°37)! (2-78)| [7-89]| 19 (5-01)| (4 31} [10-73] 
Dextrality 49 (12-93) | (11-89) | | (55-69) 51 (13-46) | (15-39) | [44-74] | 94 (24-80) | (23-91) | [53-11] | 
Totals 88 (23-22) 114 (30-08) 177 (46-70) [379 
(19) Lack of Endurance of Pull and Refraction in “ Vertical” Meridian. 
Ocular * Laterality for Re sfra vction in “ Vertical” Meridian 
i eel tee Sinistrality = 2 Ambilaterality | Dextrality 
Lack of En- es ‘of aa ee ah oo. 
eo ance of Pull o of Totals | % of fo of Tot: ue | ee ech * — % of 
| Cite: | Exp. |Column | Obs. i Exp. ~| Column | Obs. | Exp {e olumn 
} | | 
Sinistrality 29 (7-61) | (9-18) I3 33-72] 7 (14-96) | (13-03) | [46-72] | 69 (18-11) | (18-47) | [39-88] 
Ambilaterality | 5 (1-31)} (2-07) | [5-81] | "8 (2-10) | (2-94) | [6-56] | 22 (5-78) | (4-18) | [12-72] 
Dextrality ° | 52 (13-65) | | (11: 32) |\[60-47]*| 57 (14-96) | (16-05) | [46-72] | 82 (21-52) | (22-76) | [47-40] 
| 
Totals 86 (22-57) 122 (32-02) 173 (45-41) 
20) Lack of Endurance of Pull and General Astigmatism. 
(20) 
| Ocular minaenacnd for General Astigmatism 
{ a. : Sinistrality | Ambilaterality Dextrality 
a ‘ a s ) : 5 . 
hacacosad Pull % of Totals | o/ of | % of Totals o of | % of Totals °/ of | 
cure 6, 0 oa 4 OL | % 0 
| Obs. | Exp. | olumn | Obs. Exp. ( i Obs. Exp. ( aed 
| 
Sinistrality | 55 (13-72) | (12-31) | [45-83] | 79 (19-70) | (18°78) | [43-17] | 31 (7-73) | (10-06) | [31-63] 
Ambilaterality | 12 (2-99) | (2°62) |[10-00]| 18 (4-49) | (3-98)| [9-84] 5 (1-25) | (2-13) | [5-10] 
Dextrality | 53 (13-22) | (1 00) | [44-17] | 86 (21-44) | (22-87) | [46-99] | 62 (15-46) | (12-25) | | [63° 27)*| 
Totals 120 (29-93) 183 (45-63) | 98 (24-44) 


An asterisk marks a probably significant difference in percentage value. 











| 
| 
| 
| 
| 


3) 
| 


a 


5) 
3) 


>» 





1-71 ) 
(7-56) | 
10-73) 


| 


39-58) 
(9-23) 
1-19) 
100) | 





10-68) | 
(9-19) | 
90°13) 


LOO) | 
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(21) Steadiness of Hand and Near Point. 


Ocular  Laterality for. Nea ear Point 








| | 
Manué ; te 
Ianual | Ambilate rality “De xtrality. 
| 











| Laterality for Sinist rality fs Totals 
| Steadiness — = sare eo, .. : ee (ae << © ite % 
| of Hand % of Totals ar of . % of Totals i? f | % of ‘Totals | o/ of = 
ee eS ‘i Yo be ye | %o s 0 | 
} | Obs. r E xp. Aaa’ ~ Obs. | “Exp. | mel "Ge: Exp. | Column | | 
eee | | ha 
/Sinistrality = | 25 (6-11) | (3-38) |[38-46]*| 42 (10-27) | (12-59) | [17-35]| 20 (4-89) | (5-30) | [19-61] | 87 (21-27) | 
| Ambilaterality | 8 (1-95) | (2-10) [12-31] | 27 (6-60) | (7-81) | [11-16] | 19 (4-65) | (3-29) | [18-63] | 54 (13-20) 
|Dextrality ~ | 32 (7-83) | (10-41) |[49-23]}*| 173 (42-30) | (38-77) | [71-49] | 63 (15-40) | (16-35) | [61-76] | 268 (65: 53) 
Totals | 65 (15-89) | 242 (59-17) | 102 (24-94) | 409 (100) | 
(22) Steadiness of Hand and Visual Acuity. 
| | Oc cular Late rality for Visual ‘Acuity ; | _ | 
l : Manual = Sinistrality “Ambilaterality, | De xtrality L | 
Laterality for | Totals | 
| Steadiness | iT “a fl | = — el ae Baas 
| of tend 4 0 — | % of | a of Totals o% of | % of ma | % of 
Obs. | Exp. |‘ olumn Obs. Exp. \¢ olumn | Obs. | Exp. Column 
Sinistrality 41 (9-93) | (9-13) | [22-91] | 16 (3-87) | (3-52) | [23-19] | 30 (7-26)| (8-41) | [18-18] | $7 (21-06) | 
| Ambilaterality 27 (6:54) | (5-87) | | eel | | 9 (2-18)| (2-27) [13-04] 20 (4:84) | (5-42) | [12-12] | 56 (13-56) 
| De xtrality L11 (26-87) | (28-34) | [62-01 44 (10-66) | (10-92) | [63-77 | 115 27-85) | (26-12) | [69-70] | 270 (65-38 
Totals | 179 (43-34) | 69 (16-71) | 165 (39-95) | 413 (100) | 
(23) Steadiness of Hand and Refraction in “ Horizontal” Meridian. 
| | Ocular Late srality for Re fre UC tion in “ Horizontal” Meridian > | 
: : = 7 : ees 
gee foe | Sinistrality | Ambilaterality Dextrality | Totals | 
Steadiness in | , an P y 
_t _— % of Totals | o of | % of Totals | o of % of Tote als ? f | 7 
‘ he | %O ‘ : % O 4 % of | 
Obs. | Exp. | folumn | | Obs. | Exp. |Column | "Chie: | Exp. Column 
| | | | 
a 
|Sinistrality 16 (4:14) | (4-86) | [17-58] | 25 (6-46) | (6-25) | [21-37] | 39 (10-07) | (9-56) | [21-79] | 80 ( 20-67) | 
Ambilaterality 12 (3-10) | (2°92) | [13-19] | 11 (2:84) | (3°75) | [9-40] } 25 (6-46) (573) | [1907 7] | 48 (12-40) | 
Dextrality 63 (16-28) | (15-74) | [69-23] | 81 (20-93) | (20-23) | [69-23] | 115 (29-72) | (30-96) | [64-24] | 259 (66-93) | 


Totals 91 (23-52) | 117 (30-23) 179 (46-25) | 387 (100) | 


{ 


(24) Steadiness of Hand and Refraction in “ Vertical” Meridian. 


| 
| 








| | Ocular Laterality for Refraction in “ Vertical’ Meridian 
Manual Be a aes ie et ia . ; 
| Laterality for | Sinistrality | Ambilaterality | De xtrality Totals 
| Steadiness Pome r= | ae Wa. are ‘ie ™ ; 
“ a | 6 of Totals fn of % of Totals | o% of | 4 of Totals ca 0% of od % 
} : I : ‘ = 
| Obs: Exp Column Obs. | Exp. \‘ ‘olumn Obs. Exp. |Cotumn 
|Sinistrality | 12 (3-08)! (4:55) | [14-12]| 25 (6-43) | (6-74) | | 000 | 44 (11-31) | (9:53) | (24-72) | 81 (20-82) | 
Ambilaterality 12 (3-08) | (2-81) | [14-12] | 14 (3-60)} (4-16) | [11-11] | 24 (6-17) ( 5°88) [13-4 8 | 50 ie *85) 
Dextrality ~ | 61 (15-69) | (14-49) | [71-76] | 87 (22-36) | (21-49) | [69-05] | 110 (28-28) | (30-35) | [61-80] | 258 (66-33) 





| Totals | 85 (21-85 126 (32-39 178 (45-76) 389 (100) | 
| | (21-85) | (32-39) | | 


* An asterisk marks a probably significant difference in percentage value. 





Manual 7 
Laterality for | 
Steadiness | 
of Hand 








Ambilaterality | 11 
Dextrality 7 


| Totals 


Manual 
Laterality for | 
Balancing 
Test 


«8 ° | 
Sinistrality 





Dextrality 3: 


| Totals 


Manual 
Laterality for | 








Sinistrality | 3 


on 
J 

Ambilaterality | 11 
» 


Dextrality 


and Sinistrality of Hand and Eye 


(25) Steadiness of Hand and General Astigmatism. 








Ocular ‘Later srality. for Gene ral | Astigmatism | 
Sinistrality Ambilate srality .. De xtrality | Totals 
mana 22 —— EE ee a ee ] 0. 
) | | oO | | ) | / 
% of Tots als | % of | % of of Totals o% of ee. ae Totals % of ak) 0 
Obs. Exp. | ( a. Chia. E Exp. Column Obs. Exp. |cotamn| } 
(7-56) | (6-18) | [26-27] | 37 (9-03) (10-00) [19-37] | 20 (4:87)| (5-28) | (19: 80] | 88 (21-46) | 
(2-68) | (3-65) | [9- 32) | 27 (6-58)| (5-91) |[14-14]| 14 (3-42)| (3-12) | [13-86] | 52 (12-68) 
} (18-54) | (18-95) | [64-41] | 127 (30-98) | (30-68) | [66-49] | 67 (16-34) | (16-23) | [66- 34] 270 { 6: (65-86)| 
118 (28-78) 191 (46-59) 101 (24-63) | 410 (100) | 
(26) Balancing Test and Near Point. 
Ocular Laterality for Near Point | | 
Sinistrality Ambilaterality Dextrality | Totals | 
= 0 
% of Totals o% of % of Totals | o of | % of Totals = of | ss | 
Obs. Exp. Column Obs Exp. Column Obs. Exp. |Column 
(5-63) | (4-84) | [34-85] | 72 (17-65) | (17-88) | [29-51] | 27 (6-62) | (7-18) | [27-55] | 122 (29-90) | 
(2:70) | (2-89) | [16-67] Bad gw (10-70) | [17-21] | 20 (4-90) | (4°30) | [20-41] | 73 (17-89) 
(7-85) | (8-45) | [48-48] | 130 (31-86) | (31-22) | [53-28] | 51 (12-50) | (12-54) | [52-04] | 213 (52-21) | 


66 (16-18) 244 (59-80) 98 (24-02) | 408 (100) | 


(27) Balancing Test and Visual Acuity. 


Ocular Laterality for Visual Acuity 


Sinistrality Ambilaterality Dextrality 


| 
| 





Totals 
Ral Ine - | 7 | y 
— % of Totals | of % of Totals | © of % of Totals | °% of | ‘%o 
| Obs. Exp. ( ML Obs. | Exp. ( N.S Obs. Exp. Column 
Sinistrality 48 (11-63) | (12°75) | [27-59] | 25 (6-05) | (5:21) | [35-21] | 52 (12-59) | (12-31) | [30-95] | 125 (30-27) | 
| Ambilaterality | 32 (7-75) | (7-14) | [18:39] | 13 (3-15) | (2-91) | [18-31] | 25 (6-05) | (6-90) | [14-88] | 70 (16-95) | 
| Dextrality 94 (22-75) | (22-24) | [54-02] | 33 (7-99) | (9-07) | [46-48] | 91 (22-04) | (21-47) | [54-17] 218 (52-78) | 
= a 
Totals 174 (42:13) 71 (17-19) 168 (40-68 413 (100 
| ( | (40:68) (100) 
(28) Balancing Test and Refraction in “ Horizontal” Meridian. 
| | Ocular Laterality for Refraction in “ Horizontal” Meridian ; i | 
| _ cee. } Sinistrality Ambilaterality Dextrality Totals | 
— % of Totals of | Ny ‘a Totals | °% of | No a Totals a o/ of | % | 
a a % a | oO == rah! 
| Obs. Exp. Column | Obs. | Exp. {Column | Obs. | Ixp. \¢ ‘olumn | | | 
| Sinistrality 21 (5-36) | (6-90) | [22-83] | 41 (10-46) | (9-05) [33-88] | 53 (13-52) | (13-39) [29-61] 115 (29-34) | 
| Ambilaterality | 26 (6-63) | (4-42) | [28-26]| 20 (5-10)| (5-83) | [16-53] | 28 (7-14) | (8-62) | [15- 64] | a (18-87) | 
| Dextrality 45 (11-48) | (12-15) | [48-91] | 60 (15-31) | (15-99) | [49-59] | 98 (25-00) (23-65) | [54-78 51 | 208 (51-79) | 
| Totals | 92 (23-47) | 121 (30-87) 179 (45-66) | 392 (100) 
! a ee = ees —— Se oo 








t 





21-46) 
12-68) 
65-86) | 
100) | 


7 


7 89) | 





30-27) | 
16-95) | 
52-78) | 


100) | 


tals 


| 
| 
6 | 
| 


0-34) 
18-87) | 
51: 79) | 
100) | 








T. L. Woo 


(29) Balancing Test and Refraction in “Vertical” Mevidian. 





| 
Manual ae <= 
: Sinistr: 
Laterality for | inistrality 
Balancing | ~~ go, eee 
s | ° 1 
Test | % of Totals 
| Obs. | Exp. 
Sinistrality | 22 (5-77) | (6-55) 
Ambilaterality | 26 (6-83) | (3-90) 
Dextrality 34 (8-92) | (11-07) 
Totals 82 (21-52) 
| 
Manual 


Laterality for 


Sinistrality 


% of Totals 


| 
Balancing 
Test 
Obs. Exp. 

|Sinistrality | 34 (8-23)] (8-58) 

| Ambilaterality | 27 (6-54) | (5-19) 

| Dextrality 57 (13-80) | (14-80) | 
: paisa 
| Totals 118 (28: 57) 


their difference 


? An asterisk marks 1 a probably significant diffe erence in percentage value, 


Column 


Ocular Laterality for ‘Refraction i in 





Ambilaterality_ 


“Vertical” Meridian 





Dextrali ty 





% of Totals 


ree en Oe At 
Obs. Exp. | Column 
39 (10-24) | (9-99) | [31-20] 
16 (4-20) | (5-94) | [12-80] 
70 (18-37) | (16-88) | [56-00] 


125 (32-81) 


Obs. 


Exp. 
55 (14-44) (13-91) 
27 (7-08) | (8-27) 
92 (24-15) | (23-49) 


0 Ty ¢ > 
6 of Totals 


174 (45°67) 


(30) Balancing Test and General Astigmatism. 


Ocular Laterality for General Astigmatism 


% of 
Column 


[28-81] 
[22-88] 
[48-31] 


| 
i 


Ambila ater srality 


9 , of ‘Totals — 


Obs. Exp. 
62 (15-01) | (14-18) 
28 (6-78) | (8-57) 
105 (25-43) | (24-47) 


195 (47-22) 


and probably not significant. 


Oo 


Column 


of 


Dextrality 


% of Totals 


Obs. 


Exp. 

28 (6-78) | (7-26) 
20 (4°84) | (4-40) 
52 (12-59) | (12-55) 
WC 4- at) 


% of 


( Jolumn | 


% of 


Column | 


[28-00] 
20-00] 


[52-00] | 2 


Totals 


0 
/o 


| 116 (30-45) 
69 (18-11) 
| 196 (51-44) 


| 
| 
| 
| 





ak 381 81 (100) 


Totals 
0 


oO 


19 


) 
7 
1 


(30-02 >) 
(18-16) 
(: 


4 
5 
4 (51- wie 





| 413 (100) 





There are seven exceptional cases 


(especially marked by an asterisk alongside such values) in which the differences are 


over 10°0 per cent. 


in the case 


and visual sinistrality, and ‘this value being 3 


should be 


treated as 


significant. 


of Steadiness test 


and Near Poin 


t, 


The largest one is 17°19 per cent. 


(38°46—21'27 per c 
in the category of both manual 


ent.) 


‘97 times that of the probable error 
However, these special cases being very few 


in number are possibly due to the effect of random sampling or to the mixed 


character of certain of the tests, and they do not represent any general relation 


between characters in the two groups. 


From the results of the comparison by the two methods just stated, we may 


safely assert that little relation of laterality exists between any pair of ocular and 


manual variates as far as judgment by percentage occurrences is concerned, 


Further, for ease of comparison the actual percentages of each type of laterality 


given in the above 


previous cases, 


laterality. 


namely, 


table are 


divided in a 


manner 


similar 


to that 


used in the 
into three main classes, viz. same, opposite and mixed 
Since the two variates treated are of different groups, it will be more 


instructive to divide the percentages of opposite laterality into two sub-classes, 
but 


viz. 


(1) 


(2) those where the lateralities of these 


those 


which 


are 


dextral 


in 


manual 


two variates are reversed. 


sinistral 


in 


ocular 


The 


cate 


variate, 


vorv 
gory 


of mixed laterality may also be grouped according to the nature of the laterality 


in four 


sinistral 


and 


sub-classes : 


ocular 


(1) ocular sinistral and manual ambilateral, 


ambilateral, 


(3) ocular dextral and 


manual 


(2) manual 
ambilateral, 





9T-F6L-1 





‘FOF 








Z8-19 
Zo-GF 
$o-6E 
P8-LZ 
Ib-L¢ 
IL-9F 
0-88 
6-98 
16-9 
LL-6¢ 
8E-CF 
10-L8 
LO-FE 
FL-0Z 


09-8¢ 


82-LP 
66-9€ 


9F-O1 
c0-9 
C9-LI 
£06 
€F-9 
9F-9 
L8-€ 
Lz-01 
OL-61 
96°F1 
G3 FI 


GEL 


90-91 
TST 
PF'6 

O1-ZE 
L8-9€ 
OL‘:SE 
O€-FE 
€L-él 
O0G-L¥ 








s[BIO I, 


Aqyeriszey | Aztpe19837v] 
-1quy -1quiy 
Iepnog jenueyy 

pue pue 
| Ayrypeayxeq | Azypeayxeq 
[enueyy Ivpnoag 


pue 
Aqrpeaysturg 
jenueyy 


Ayrypesoqepiquy 
pue AzIperyxeq 


Ayypeseye'T poxtpy (TTT) 


Aqyereyzey | 


Aqrpeaqsturg | 


SFE | 06-F 


CT | £9-F STI | ZL: 


8L-9 
bP: 
€ 


aS 
L6°0 
8e'1 
| 
Olé 
€L-0 


-1quy 

jenueyy | 
pue [enueyy 
pue s[vqOT, 


Ay 


cO-€E | S8- 





sk 


ay 


oe) 


IA+R At 


Aytye19} PT 
Iquiy 


Aqrperyxoq 


uoryeIAeg prepuryg 





uve 
(TF) UISI}VUIDISY [vlouexy) puv 
(18g)  ,,[e214eA,, UOovIjoy pue 
(Z6&) .,[v}UOZUOPH ,, uolovljoy pue 
(SIF) Ayimoy [enstA pue 
(SOF) 4 4Ulog IvoN pure 4sayz, Surloueleg 
(OLF) UISIVVULSYSY [vleuexy pue * 
(68€) [89494 ,, UO OBIJoy pue 
(L8&) ,,[@}UOZLIOP ,, UoMovajoy puv 
Aymoy [VnstA pue 
qUIOg IveN pue 4saJ, SSAUTPRayg 
LOF) UISIVVULDISY [eieuer) pue “* 
(18) ,,[¥949A,, UOMovIjoy pue “ 
€) ,.[@}UozUIOF ,, UoMovijoy pue “ 
Ajinoy [enstA pue 
qulog 
ivan pue ([[ng 4seT—-[[Nd 
48117) [[Nq Jo souvinpug jo youT 
(F0F) WSIVVULSSY [vilouex) pue “ 
(48) ,,[B0IeA,, UOORIJey pue “ 
(LLE) ,, [ey UOZMOF ,, UoTNovljoy pue “ 
(OUP) Aymoy [enst, pue “ 
(ZOF) quiog IveN pue [[ng 4sv'T 
(S0F) UISIPVULSISW [vious pue *“ 
(98&) ,,[B01j0A,, UOMOBIJoy pue “ 
(ES) ,.[e}UOZUIOY ,, uoovljoy pue “ 
Aymoy yenst, pue “ 
qulog IveN pur [ng Isilq 
WIST}VULS SY [elouex) pue * 
.,[ROM49A ,, UOTJOVIJEY pue “ 


“ 
“ 


6s 


“<s 
“ 


‘ 


60F) 


(LO) 


}UIOg Ivan pue duy 


Aqrpeazsuig 
(UoljvArasq(¢) SIezORIVY_D FO 





AqipereyeryT aytsoddg (TT) 


Aqyesaye’yT ure 


S 


(1) 


| 


| 








"SLajDIDY Jonunpy pun wnjnag sof hiyyosaywT fo sadhy fo sabnjuaosag 


TAX ATVI 





-86 |11-76 1-02 











+H 
0 
; 2 
= 
+ 
~ 
~ 





oa 








| Standard Deviation 








T. L. Woo 127 


(4) manual dextral and ocular ambilateral. Table XVI ‘provides the classified 
percentages of each type of laterality and their mean percentages and standard 
deviations are also given. On an examination of this table it will be seen that 
the mean percentage of like laterality is 33°02 + ‘72 per cent., or only one-third of 
the whole population. If the mean percentage of same ambilaterality be excluded, 
like laterality is slightly more than the mean value of opposite laterality (2°66 
per cent. more) and much less than that of mixed laterality (11°18 per cent. 
less). This indicates that about one person out of three will probably follow 
the rule of absolute laterality, right-eyedness accompanied by right-handedness 
and left-eyedness accompanied by left-handedness, and that the other two either 
may get the opposite lateralities or be ambilateral for at least one of the pair 
of characters. If the theory of universal laterality were accepted we should expect 
the highest percentages in the categories of the same dextrality and sinistrality 
and the lowest ones or, according to the strict theory, nothing at all in the 
remaining categories. Such a theory is completely out of harmony with the 
results of our present investigation. 

Even if we could disregard two-thirds of the population and consider only 
the group where the lateralities of the two characters related are alike, the mean 
percentage of ambilaterality in this group is still significantly large, being 
11°5 per cent. 

In the case of opposite laterality, the individuals on the average who are 
dextral in hand and sinistral in vision are approximately 4°1 per cent. more 
numerous than those in whom these lateralities are reversed. In the case of 
mixed laterality the mean value of Class (4) is nearly 10°0 per cent. more than 
that of Class (2), while the mean value of Class (3) is only about 1°0 per cent. 
more than that of Class (1). In other words, when we take ocular ambilaterality 
there is a large variation between the percentages associated with manual 
dextrality and manual sinistrality, but when we take manual ambilaterality there 
is only a slight difference between the percentages of ocular dextrality and 
ocular sinistrality. 

The value of the standard deviations for thirty samples of any type laterality 
is large and significant as in the previous cases. In other words, there is no 
even approximate constancy in the percentage values for the different pairs of 
characters. 

(b) On the quantitative Relationships between the Dominance of Ocular and 
Manual Characters. 

We have shown that there is no relationship at all in the percentage occurrences 
of the various classes of laterality for various pairs of ocular and manual characters. 

Let us consider, from the same data, if there be any quantitative relationship 
between them. 

(1) Difference of Grip and Near Point (A, and B,), 


In a previous paper we have shown, on the basis of Galton’s data, that there 


is little relation between the laterality of one manual character, Grip, and one 








Dextrality and Sinistrality of Hand and Eye 





ocular character, Visual Acuity, measured by a test of reading diamond type. 


We have now to investigate the quantitative relation, if any, between dominance 
of Grip and dominance of five various visual characters. We shall first deal with 
the difference between the Grip of the two hands in pounds and between the 


Difference of Distance of Near Point in Inches. (Central Values.) 


TABLE XVII. 


Laterality of Strength of Grip and Distance of Near Point. 








Right minus Left. 
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distance of the Near Point of the two eyes in inches. Table XVII gives the 
data on which the following numerical results are based. They are taken from 


411 cases with the exception of a few incomplete results. 
stants are: 


Product moment correlation coefficient : Y4iB, = 


~I 


) 


+ 0332. 


The correlation con- 
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Correlation ratios: 
Laterality of Grip on Laterality of Near Point : 
14,8, = 014125, 774, p, = 012165 + 005152. 
Laterality of Near Point on Laterality of Grip: 
2, 4, = 025049, 7g, 4, = 034063 + 008524. 

It is quite clear that the value of r4,z,, i.e. the correlation coefficient for the 
given two variates, is too small to be significant. Both correlation ratios are also 
insignificant considering the value of #. Diagram IX shows the two regression 
lines which, considering the indicated values of the standard deviation, do not 


ditfer significantly from the mean horizontals. 
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Differerece of Strerdth of Griz ux Pourds (Righi minus Left) 
Diagram IX, 


(2) Difference of Grip and Visual Acuity (A, and B,). 

As we have already stated the Visual Acuity was measured by the Snellen 
Scale and not by the diamond type used by Galton. We have r4,,, = "0194 + 0330, 
calculated from Table XVIII. This value is quite insignificant with regard to its 
probable error, therefore dominance of grip in right hand is not associated with 
dominance of visual acuity in right eye. Applying the test of the correlation 
ratio the results are as follows: 

4, By = (025313, "724, p, = “028640 + 007768, 


l 
’y4, = 033547, 7p, 4, = 083418 + 008364. 


Both values of »?—% are very small and neither is, therefore, significant. The 
two regression lines in Diagram X are sensibly horizontal as in the previous case. 
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It may.be noted that these results agree very well with those from Galton’s 
data in spite of the fact that different subjects were tested and that the Visual 
Acuity was measured by another method. 


LATERALITY OF GRIP & VISUAL AGUITY 
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Diagram X. 

(3) Difference of Grip and Refractions (A, and B,, B,). 

The dominance of Grip was tested against the Refractions measured in dioptres 
both for the vertical and the horizontal meridians. The data have been separately 
tabulated in Tables XIX and XX (pp. 132—3). We have the following results : 
007694, 74,2, = "020460 + 006821, 
022954, 7p, 4, ="033248 + 008638, 

~ 2» = 015204 74. pn, = 020356 + 006786, 
Y4,B, = — ‘0097 + °0340 (393 cases), \” 4B , ig 7 41 Be ae m ee % 
7 (73.4, = ‘016139, 7B, 4, = 033079 + ‘008594. 


Obviously none of the above constants can be regarded as of significant value. 


Tasm= 0114 + 0341 (391 cases), )" 418s = 
iia (7° B3 Ay 


Il 


The two product moment correlation coefficients have opposite signs but the 
values are smaller than their probable errors and accordingly insignificant. All 
the correlation ratios are also insignificant. From these results we may assert 
that the dominance of Grip of the hands has no relation at all to the dominance of 
the two Refractions of the eyes. Diagrams XI and XII show the regression lines 


for each pair of variates. It will be seen clearly that we could hardly do anything 


better than use the mean lines as regression lines. 
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TABLE XIX. 


Dextrality and Sinistrality of Hand and Eye 


Laterality of Grip and Refraction in the Horizontal Meridian. 


Difference of Refraction in the Horizontal Meridian in Dioptres. (Central Values.) Right minus Left. 
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TABLE XX. 


Laterality of Grip and Refraction in the Vertical Meridian. 


Dioptres. (Central Values.) Right minus 
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Diagram XI. 
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LATERALITY OF GRIP & REFRAGTION IN THE VERTICAL MERIDIAN 
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Diagram XII. 
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(4) Difference of Grip and General Astigmatism (A, and B,). 

The last ocular character to be considered in association with Grip is General 
Astigmatism. The data from 415 cases are presented in Table XXI. This leads to 
the following value: ry.p,=—°0544 + °0330. It has a negative sign and is a 
little higher than r in the case of Vertical Refraction with Grip, but as it is only 
13 times its probable error, it is of no significance. The correlation ratios are : 

14, B= 010156, 24,3, = 014458 + “005583, 

nm, 4, = 035167, 7"y, 4, = 031325 + 008147. 
There are no significant differences between 7? and 7°. The two observation 
regression lines of A, on Bb; and B; on A, are plotted graphically in Diagram 
XIII; they show no significant deviations from the mean horizontals. 


LATERALITY OF GRIP & GENERAL ASTIGMATISM 


























a oo tT 
“S 4 - : = \g 

4 @ ---@ Re§cession of Differences, Astigmatism orGrip [5 

> 

& O---O ” ” ” Grip. on Astifmatisn: ; 

c 1 

“a -2°0 \ 

= 1 

fs ' 

— re) 

wn 

S A 
oA i ‘| 

2 ae 
A i 
ee ae 

oe ! - 

& 1) i 

¢ | “af él 

5 ee eee! 2 _&?. o¢. seen Ot. ee 
ie sooo v F . 

ry ° 
— 1o 
*“B “8 = 
x 1: 
a 4 10 

7 

rc) | 

2 

EY | 
© 

a 2-0} \ 

3 

= 

S | 
A ” A ee ae a 

_ me RSS ATA! TSA, SATS WAT r Sat) Sane 
15-0 “410-0 -5-0 o 50 10-0 15-0 20-0 25-0 


Difference of Strength of Grip ie Pourds (Right minus Left) 
Diagram XIII. 


From the results given above we may safely conclude that superiority of Grip 
on one side is in no way associated with superiority of any of the ocular characters 
tested by us. 

Our next step should be to study the relations of other manual characters with 
each of the above ocular characters. Owing to the limited space at our disposal we 
have not published the other twenty-five correlation tables and the diagrams— 
the character of these is adequately illustrated on Grip—but the results for 
r and »* for the remaining four manual characters and each of the ocular characters 


are given in Table XXII (p. 38). 


[t will be seen that all the values of r are very 
small and insignificant in comparison with their probable errors. For instance the 
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highest value we have obtained, i.e. 74, p, = 0575 + ‘0332, is only 1-7 times its prob- 
able error. The coefficients vary from —°0589 to + ‘0575, half with the positive sign 
and the other half with the negative sign. If we group these coefficients on a 


rough scale we obtain the following frequencies : 


r<—050 —050<—r<——025 — 02% <r <0 O<r—4t 025 +025 <—r—4+050 +°050<r 
Observed 3 6 6 55 15 2 
Expected 4:8 4-5 5:7 57 45 4:8 


It is very interesting to see that the coefficients seem to be symmetrically dis- 
tributed, each lying within the limit of its probable error around the central 
value zero, Although we are unable to represent the distribution of correlation 
coefficients for all possible ocular and manual characters, we have 30 samples of such 
from nearly 400 individuals, and accordingly a distribution like the above hardly seems 
to be accidental. The observed mean 7 = + 0012 +°0045, and the observed standard 
deviation of the samples = ‘0365 + 0032. If we assume the expected mean r=0, 
then in case there were no correlation the standard deviation of random samples 


from the 400 cases = ae = 05, and the probable error of the expected mean r 
V 400 
becomes ‘0061. In other words: 
Expected mean * = 0 + 0061, Expected S.D. = -0500 + 0044. 


Observed mean 7 = 0012 + 0045, Observed S.D. = 0365 + 0032. 


We can see from these results that, if the probable errors are taken into account, the 
differences between the observed quantities and those expected on the hypothesis 
of independence are quite insignificant. There is little doubt that these corre- 
lation coefficients can reasonably flow from a population in which there is no 
association between manual and ocular laterality. We can test this matter in 
another way: here are 30 samples of about 400 individuals; let us ask, could 
the population 7 be zero on the basis of the observed coefficients? The values 
given in the third column of Table XXII are found by the area of }(1—a) 
from the Table of the Probability Integral. They indicate the probable number 
of occurrences in 30 random samples which would lie outside the range + 
observed rv. No single 7 has any high degree of improbability. Thus in more 
than half the cases an r as great or greater would occur over 15 times in 30. 
Further, turning back to the rough frequency grouping above, the y* = 484, 
which for 6 groups gives P = 435, or a reasonable probability for the system as 
a whole. This test again confirms our result. Moreover, not one of the correlation 
ratios shown in this table, either of manual on visual characters or vice versa, 
is significant when compared with the difference of »*—# and the probable error 
of 42. We may also consider the question of the independence of the lateralities 
of ocular and manual characters by testing the system of »*’s as a whole. Suppose 
we take 


= Observed »? — 7? 


Z 


oF 
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—9 


where 7 is the value to be expected on the basis of independence and o% is the 
standard deviation of #, or of samples from a population showing independence. 
Then the values of Z should form approximately a distribution about a mean of zero 
with a standard deviation of unity. The probable error of Z will be accordingly : 
67449 
Vn 
Taking the thirty cases of manual on ocular characters : 
Z = — 1168 + °1231, 
and again the thirty cases of ocular on manual characters : 
Z=— ‘0524 + 1281. 
Thus Z is in both cases negative or the weighted mean 7? is below the weighted 
mean 7. In both cases also the value of Z differs from zero by less than its 
probable error. We may therefore conclude that the whole system of correlations 
between manual! and ocular characters is consistent with absolute independence 
between manual and ocular lateralities. Thus all our investigations of the relation- 
ship of laterality of hand and eye entirely fail to confirm the theory, held by many 
authors, of universal absolute laterality, that is a superior eye determining which 
shall be the superior hand. 

From our study of five different ocular and six manual characters we can 
safely say that there is no association either in coincidence or in intensity between 
the dominance of any character in the eye and any character in the hand which we 
have studied. 


Problem 5. Js either Manual or Ocular Dominance in itself a clear cut charac- 
teristic? Is it possible to assert for every character of both hand and eye that an 
individual is either dextral or sinistral? Is ambilaterality non-existent, or does it 
occupy an appropriate position midway between the easily recognised cases of 
deatrality and sinistrality ? 

The answer to the questions involved in this problem are in a certain sense 
matters of definition, If we measure any anthropometric character of hand or eye, 
we find a continuous distribution of values forming what is termed a frequency 
curve. If we take the difference of characters on the right and left sides of the 
body, whether they be manual or ocular, we also obtain a frequency curve. In the 
characters we have dealt with in this paper, these frequency curves of laterality are 
not bi-modal; we have a continuous curve rising from extreme dextrality through 
a modal value, which may or may not approach the zero value of the variate, the 
curve afterwards falling again to extreme sinistrality. In other words the ordinate 
of this curve at zero difference occupies its proper place in the continuous series 
of ordinates. 

This is illustrated by Diagrams XIV and XV for laterality distribution in the 
cases of Laterality of Grip and Laterality of Visual Acuity. 


Now if we defined ambilaterality to arise when the difference was exactly zero, 


the occurrence of such cases would be practically impossible, if our measuring 
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instruments were indefinitely finely graduated. In other words “ambilaterality ” 
would be solely a dividing line between plus and minus differences. Accordingly, 
any individual would be certainly either a dextralist or sinistralist for every 
character. But such a definition is for practical purposes absolutely idle. With 
a like definition emmetropia becomes merely the border line between myopia and 
hypermetropia, or indeed myopia itself a border line between two classes of myopic 
astigmatism. Yet ophthalmologists make substantial groups of emmetropic and 
myopic individuals. It is perfectly true that these groups, as those of ambilaterality, 
must be largely conventional, depending on the differences we find it of practical 
convenience to disregard. The fact of the arbitrary or conventional size of the 
group of ambilateralists is in the present case of little importance, for no dividing 
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in any proportions of those we have termed ambilateralists can possibly bring 
into anything like accordance all the various percentages found for dextralists and 
sinistralists. We may illustrate this by such a character as Visual Acuity, read 
by Snellen’s types, converting the ratios into decimals. Here the range chosen for 
ambilaterality was that containing differences from — ‘015 to +°015 and the per- 
centages were : 


Dextralists: 39°14°/,; Ambilateralists: 16°47°/,; Sinistralists: 44°39°/, ; 
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but had we taken our range of Ambilaterality from —-045 to +045 we should 
have had: 

Dextralists: 30°31°/,; Ambilateralists: 37:23°/,; Sinistralists: 32:46°/,. 

The distinction between our standpoint and our observations and those of van 
Biervliet is not in the proportion of ambilateralists, buf in the nature of the 
frequency curves for the differences. He asserts an absolute absence of frequency 
for a long range round zero difference; we find a contiz.ous frequency curve 
covering the whole of this range round zero difference, and the number of persons 
to be treated as ambilateralists depends: (i) on the fineness with which we can 
measure our characters, or on the coarseness necessary or profitable in practice, 
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Diagram XV. 
(ii) on the fact that no character in a human being remains absolutely constant; 
the pull of an ambilateralist may be greater on one trial with the left and on 
another with the right hand, and (iii) the personal equation or error of the recorder, 
e.g. it is no use trying to read refractions closer than } dioptre ; for if we did, our 
ambilateralists for refraction would be merely a function of the personal equation 
of the ophthalmologist. Hence in our record of the frequencies and corresponding 
percentages of ambilateralists, we have chosen ranges for grouping, which signify 
that there is practically no difference in the efficiency of right or left side ; it would 
be idle to term a man a dextralist who could on the average pull half an ounce 
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more with his right than with his left hand, or a sinistralist who could decipher 
diamond type a tenth of an inch further off with his left than with his right eye. 

} The chief source of the different percentages found for the various characters 
arises, we believe, from an essential difference in the types of their frequency dis- 
tributions; they may be symmetrical about zero difference, or skewed to the right 
or to the left. In other words if we had an adequate number of observations, we 


. might determine the frequency curve for each character difference, and then inte- 
; grate its areas from zero difference in both sinistral and dextral directions, we 
should thus reach absolute dextral and sinistral percentages only, with no ambi- 


laterality. Short of this there is no reasonable method of truly proportioning the 
ambilateralists, who form the central group on a symmetrical subrange round zero. 
But such a proceeding could not equalise our proportional percentages for all 
ocular and manual characters, and what is more it would destroy the important and 
useful group of ambilaterality, the individuals with practically equal lateralities. 


Problem 6. To consider to what extent the Correlation of Lateralities in two 
Characteristics depends on the absolute Organic Correlations. 

The last question we have to consider is the connection which exists between 
the correlation of the intensities of laterality for two characters and the direct and 
cross organic correlations of those characters. 

Let X =difference of right and left value for first character, 
and :— 2 ol oe SRG is » second e 

Let a, and y, be the absolute values of the two characters on the right side 
and a, and y, of the same characters on the left. Let o,,, cy,, o2,, gy, be the 
corresponding standard deviations. Then as usual we have: 


Vay Fa Fm + Ve Fay F yg — Vxay Fe, 


oS =e Se eee (€,). 


a a ae ra 
Vou, + Fxg — “1 x0 Fx Fxg . yy + oy, — 21 W427 1 F ye 


227%, Fys — Vx 








i 2 dmg 
Thus ryy is known if we ascertain 
(i) the variabilities of the absolute measurements, 


(ii) the organic correlations r,,,, and Tz, y,; 





(11) the cross organic correlations 7z,,, and 1z,y,, 
(iv) the bilateral organic correlations rz,,, and 7y, y,- 


Now as a first approximation we might assume the variabilities on right and 


1b; — : 
left sides to be roughly equal, 1.e. 

on : . 
er, Oz, =z, and ay,=dy,. 
ur Yeiys t+ Vaeye — Vays ~~ xe . 

Hence : Try = _ — Sale “iva _“3¥1 | approximately ........... oneal Gah 
on 21 — 1x, aN 1—Ty,y, 

Cr ° 7 . . . . 

ng But there is some reason for believing that for various organs 
ify 
ild Toye = , (xn TyVe = Trews Vayw2)s x04 — 2 (Trove Turye + Tan x42) eee (€3), 


approximately. 
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Accordingly : } 
Tay, (li — aT ny — $1 xyx2) + Vagus (Lt — Vz 29 — 41% yy) 
= — = 








V1 —1z,2,V1— 


r 442 


Silat =e preerer, eS aes 
= 3 (Tay, + Trey) —— , approximately. 


V¥(1—fre,2,)(1- Tne) 


Further, suppose that the correlations on the two sides of the body are the 
same, i.e. 





Vey, = Tey, = Txy, Say, and the correl: ations of homologous organs are all 

the same, Le. 7z,7,=7y,,,. These things are not far from the truth. For example: 
XX 1¥2 4 

Right and Left Femora =-976 and Right and Left Tibiae = ‘963, middle phalanges 

of middle and ring fingers on right hand = ‘937 and on left hand =-963. Also as a 

rule homologous organs have a correlation of about ‘9. Consequently we should have: 


(1-2 x 4-9) 


Puy ts ei é 
v(1—“9) 


= gy> 

or the correlation of differences would equal the correlation of absolute characters. 
And thus, if, as is probably true, there was a very small or zero correlation between 
ocular and manual absolute characters, we should find no correlation, as we do find, 
between ocular and manual lateralities. 


It seems worth while inquiring where the above relationship fails in the case 
of one or two instances. Is it because right and left side variabilities are not ' 
approximately equal? Or, is it because the formula for cross organic correlation 
is not adequately approximate ? 

The data for three cases are given in Table XXIII on p. 145. From these it 
appears that the right hand is significantly more variable than the left for Grip and 
the left hand significantly more variable than the right for Steadiness. There is no 
significant difference between right and left side variability for First Pull, Balancing 

Near Point. Accordingly only in the third case, and then only neglecting 
differences of the order of the probable errors of random sampling, would it be 
legitimate to use a formula like (¢,) instead of (¢,). 


Formula (¢;) in the case of Grip and Initial Pull gives r,,,,=°4213 and 
leyy, = "4204 as against the observed values ‘4698 and ‘5435 respectively. The 
former might just pass, the latter is too divergent for us to admit (¢,) in this case. 
In the case of Steadiness of Hand and Balancing we have *1299 and ‘1312 as 
against ‘2091 and ‘2769 for r,,,, and rz,,, respectively as observed. 


Lastly for Near Point and Initial Pull we have ‘0239 and ‘0233 as against *0634 
and 0112 respectively. It is fairly clear therefore that we cannot make use of (¢;) 
to simplify (e,). The last column of Table XXIII shows that (e,) gives results in 
reasonable accordance with the direct observation and application of the product 
moment method, but as it involves finding six correlation coefficients instead of 
one, it is hardly likely to be used unless these coefficients are needed for other 
purposes. It may be noted that the cross correlations rz,,, and rz,,, come far more 
nearly equal by (e;), than if they are found by direct observation. 
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IV. SUMMARY AND CONCLUSIONS. 


From the results we have discussed above in detail the following geneval con- 
clusions may be drawn: 

(1) On the basis of our present data it is not true that al! characters of the 
hand and eye are universally superior on one side and that this dominance is 
always of the same definite amount, namely in the ratio of 10 to 9, as van Biervliet 
held in his researches. The superiority of Grip in favour of the right hand is large 
to the average amount of 4°5 lbs., and for Steadiness and Balancing the results 
are about 1°5 times better in the right hand. But for other characters the degree 
of superiority of right side over the left is comparatively small, only measured in 
the decimal place of the unit adopted, or it may even be definitely insignificant. 


(2) There is a large variability in the percentage occurrence of each type of 


laterality for the various characters tested. It has been shown that none of the 
percentages of dextrality agrees closely with the three-quarters dominance of the 
Mendelian ratio, and similarly there is no single percentage corresponding to the 
recessive Mendelian quarter except in the case of Horizontal Refraction which quite 
possibly arises from the fluctuations of random sampling. The frequencies of 
ambilaterality occurring for every character are so large that none of them can 
be regarded as non-existent if its probable error be taken into consideration. 


(3) As far as our data are concerned it is impossible to say that persons 
dominant on one side for one manual or for one ocular character will be dominant 
on the same side for another of the same group, and still more impossible to say 
that they will be dominant on the same side for both groups. The mean percentage 
of same laterality, even if we include the frequency of the same ambilaterality 
either for the manual or the visual group, fails to exceed half of the whole popula- 
tion. That is to say, on the average, more than half the individuals of the total 
population will be dextral in one character but sinistral or ambilateral in others. 


(4) The interrelations of the dominance of the five ocular characters are very 
variable. The lateralities of Visual Acuity and Refraction, taken either for the 
vertical or the horizontal meridian, are associated to a moderate extent (both = *206) 
and are both significant in comparison with their probable errors, The association 
between the laterality of the two Refractions is ‘66, a substantial relationship. There 
are three other correlations which just tend to be significant, two for the cases of 
the dominance of Near Point in relation with that of Visual Acuity and the Hori- 
zontal Refraction, and another for the dominance of Vertical Refraction and General 
Astigmatism. The other four correlation coefficients for the remaining pairs of 
visual characters are too small to be significant. 


(5) The interrelations of the dominance of six manual characters are also very 
variable. We find two high associations with opposite sign in the manual group, 
one is positive in the case of the dominance of First Pull and Lack of Endurance 
(+ 606) and the other negative in the case of the dominance of Last Pull and Lack 
of Endurance (— ‘636). There are significant and positive correlations of ‘17 and ‘24 











T. L. Woo 147 


between the dominance of First Pull with those of Grip and of Last Pull. The 
other eleven correlations are probably not significant and some are lower than the 
values of their probable errors. 


Not only is there a high association between the dominance of First Pull and 
Lack of Endurance in pulling but also two large correlations have been found for 
the absolute measures of these two variates for both hands. It seems probable 
that the correlations between Dominance of Lack of Endurance and Dominance of 
First and of Last Pull are due to spurious correlation as the sign indicates. 

(6) The percentage occurrence of each type of laterality compared with its 
corresponding expected value on the theory of no association for any two characters, 
whether in the manual group or in the visual one, shows a fair correspondence with 
the relationship as dealt with by the correlation method. 


(7) There is little truth in the theory of absolute laterality. It has been found 
that among thirty samples of one ocular character related with one manual one, 
no single value of the associational constants, whether linear or skew, can be 
treated as significant. The observed mean r is (0012 + ‘0045, which is not signifi- 
cantly different from the expected mean r, zero, and this is confirmed by different 
methods of testing. 

The same is true in the comparison of the percentage occurrence for any pair of 
characters of hand and eye. It has been found that the difference between the 
observed percentages and the expected ones on the basis of no association is in 
general insignificantly small, viz. the frequency of laterality in hand and eye is 
quite independent. 


(8) By considering the frequency of laterality for any two characters of hand 
and eye, the absolute theory can also be disproved. The mean percentage ef the same 
laterality shown in the table is 33°02 + “72 per cent. against 26°57 + ‘90 per cent. 
of average opposite laterality and 40°41 + 1:45 per cent. of average mixed laterality. 
In other words, two-thirds of the general population practically fail to follow the 
absolute theory. 


(9) The classification and frequency of the three lateralities are not clear-cut 
characteristics varying with the character treated. They may be affected to a 
certain extent by such essential factors as the magnitude of the unit of measure- 
ment and the form of the frequency for a given trait. No particular person can be 
designated definitely as a dextralist, sinistralist or ambilateralist. He may change 
his type of laterality from one to another character under different experimental 
tests. The frequency of ambilaterality has really a magnitude appropriate to the 
value of the zero difference; it lies in a continuous distribution with a positive 
varying frequency of dextrality on one side and a negative one of sinistrality on 
the other. 

(10) The intensity of the correlation of two absolute characters is not in itself 
acequate to determine the degree of laterality of the same characters. The in- 
tensity of the latter correlation depends on the values of four standard deviations 
10—2 
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and three pairs of correlation coefficients for each side as described above. There- 
fore, the correlation of laterality cannot be predicted merely from the corre- 
lation of absolute measures. 

Our data including eleven fundamental characters of hand and eye with an 
adequate number of cases seem much wider and more reliable than those with 
which van Biervliet and Parson have dealt. We believe that the present material 
can thus form a sufficient basis for solving the problems which have been pre- 
viously raised. But all the results shown are as opposed to the theory of universal 
laterality as we had found them to be in Galton’s data. We venture to assert that 
that theory, tested in many ways in the present paper and shown to fail, cannot 
be accepted on the ground of more sparse data treated by less adequate statistical 
methods in earlier memoirs. It is at any rate far from proven at present. 

In conclusion I should like to express my great indebtedness to the editor, 
Professor Karl Pearson, for his helpful suggestions and criticisms in connection 


with this paper, and to Miss Ida McLearn for her careful preparation of the 
diagrams. 

















ON A METHOD OF ASCERTAINING LIMITS TO THE ACTUAL 
NUMBER OF MARKED MEMBERS IN A POPULATION 
OF GIVEN SIZE FROM A SAMPLE. 


By KARL PEARSON. 


THE problem suggested in the above title is a very fundamental one in statistical 
practice, but I have not seen the current method of dealing with it criticaily 
examined or justified. Suppose a population to number NV individuals, of whom 
p are actually marked by a special characteristic and qg not so; thus N=p+q. 
Now suppose a sample n taken of this population, and that in this sample r 
are found marked and s not so. Then the usual method is to consider the 
percentage of marked individuals to be 


rs 


Y 4 RT-4AC / i 
100 + 67°449 Ne (1), 


and to suppose that the probable error thus determined will measure in a rough 
sort of way the possible deviation of the sample value 100r/n, from the actual, 
and unknown 100p/N. 


I do not think the reasoning leading to this result is satisfactory; first, because 
the result is independent of the size of the population sampled, and secondly 
because it really throws us back on the normal curve as representing the binomial, 
and this will only be correct if r or s be not small as compared with n. 

The problem before us may really be said to consist practically of two some- 
what different propositions : 

(1) We draw a sample n=7 +s, and ask on the basis of this experience, 
what will be the distribution of r’ and s’ in further samples of size n’ = 7’ + s’. 

This problem involves an appeal to Bayes’ Theorem. I discussed it at some 
length in 1907*, in a paper entitled: “On the Influence of Past Experience 
on Future Expectation,” and will only refer here to that paper. 

(2) We draw a sample n=r+s, and ask on the basis of this experience 
what is the likelihood of various values of p and q in the actual population J. 
In other words, knowing r and s, we seek the distribution of p and q. 


This is the problem I want to discuss on the present occasion, and it has to 
be clearly distinguished from (1). 


* Philosophical Magazine, March, 1907, pp. 365—378. 
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If the actual numbers in the sampled population were p and q, then the 
chance of drawing r and s in a sample of , out of N, is 


a |n p(p-1)...(p-—r+ 1) q(q-1)..-(q-s +1) 
— |r |s N(N-1)...... (N —n+1) 


i ie ae 





= _ EO ccdinadenees il). 
ls lp—r la—s ™ 
ree aS £ 

Now the possible values of p are from p=r to p= N—s, corresponding to 


q=N-—r to q=s. Hence, by the theory of inverse probabilities*, the prob- 

ability of p given r in a sample, i.e. C, ,, say, will be proportional to the above 
expression, or writing the constant as C,, will be 
, iP ia 

pr = Cy » | 

Ip-r |q 


C : 
=~ 
my . . . . a5 = 
lhe series accordingly representing the various chances that the actual popu- 
lation contains 7, r +1, 7+ 2... N—s marked individuals is given by 
o lr |N—-r |r+1|N—-r—-1 |r+2|N-r-2 
S,..=C eae he or ene eat Sera +...) 
pr * 1/0 |W —n [1 |N-—n—-1 |2|N—n—2 j 


Goin |N=r (,  rt+iN—n , (+1) (r+ 2) (N-2)(N-n-1) 





i.” 2 a 2 W=nW=r-T) 
ls |V —s) 
+..+5 Se a ie ee (iv), 
\r N-r) 


the series having a finite number of terms. 
Disregarding the constant, we have clearly to deal with a hypergeometrical 
series, wherein 
a=r+1, B=-(N-n), y=-(N- r)) 


ore Vv). 

and y—-a-B—1=—(n+2)=X, say J ) 
We will now determine C, so that S,,, is equal to unity. This involves deter- 
mining the sum of the finite hypergeometrical series in (iv). We can achieve 


this as follows: If a bag contain pM white and gM black balls and we take m 
balls out of the bag the distribution of 0,1, 2... m black balls is given by the 
terms of the series 
pM (pM — 1) (pM — 2)...(pM-m+1) 
M(M —-1)(M-2)...(M—m +1) 








‘ fy + m gM © ak m (m —1) qM (gM —1) 
( 1 pM-m+1 1.2 (pM—m+1)(pM—m +2) 
+ m(m—1)(m— 2) qM (qM — 1) (qM — 2) ‘ 
Reece (pM — m+ 1)(pM—m + 2) (pM —m +3) a : 
and the total sum must of course be unity. Hence we have 
M \pM —m m qM m (m—1) qM (qM —1) 








(M—m pM = 1 pM-—m+1 7 (pM —m+1)(pM—m +2)" sis 


* See Appendix I to this paper. 
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Put a=—m, B=—qM, y=pM—m-+1 and we have, replacing factorials by 
I’-functions, 
P(y—-a-B)U(y)_,, WB , a(atVAR(8+1) 
P(y—a) TP (y—8) lly 2! y(y+ 1) 
This is the sum of the series on the right-hand side supposing that series to be 
finite or converging. 








Now in our special case 
a=r+1, B=-(N-n), y=-(N-?), 
and all the ['-functions on the left have negative arguments. They must therefore 
be raised by ['(n)=T' (n+ 1)/n to positive '-functions. We have 
. 1 
hs nn+1°"n+p—1 
Now let n= — 24, 
C(—-p+e)= iin eas - 
—pte—ptetl +€ 
. (— 1)? T fe) 
~ (p—)(p—e—1)...(—-e)" 





(n+p). 
a, I" (e) 


Now in our case 


a=r+1, B=-(N—-n), y=-(N-r), y-a-—B—-1L=—(n+2)=A\, say, 


y-a—B=-(n+1), y—-a=—-(N41), y-—B=—-(n-7) ...... (vi). 
— T(y—a—B+e)l(y+e) 
: Hence Fae ls 840) 


(— 1°" P(e) x (- 14" P(e) Se 
(n+1—e)(n —e)...(1 —e)(N—r—e)(N—r—e-l)...(1—€) 
(— 1)4*' P(e) x (—1)"" P(e) 
) (V+1-—e¢)(N—-e)...(1—e)(n—r—e)(n—-r—e—1)...(1—€) 
_(N+1-¢)(N-e)...(1—€) x (n—r—e)(n—r—e—1)...(1—€) 
~ (n+1—e)(n—e)...(1 —e) x (N-—r—e)(N—r—e—1)...(1—-€) 





We can now put e=0 and we have 


P(y—a-P)V(y)_ |N+1 |p" 


, TP (y — a) C (y —B) \n+1 |\N-r 





for the sum of the finite hypergeometrical series in (iv). Hence if we choose S,,, 
to be unity 

jak Et eee 
~" |N-n |n+1|N-r 





0 IN—n jn +1 l 
or 9 => TAT 
| 





+1 | ln—9r’ 








or (iv) may be written 
N=r(, +1 N=n, (r+1)(r+2) (N-n)(N-2-)) 


g \n+1 


mr =IN+1 |n—-r tl. iL Nor 2 (N-r(W-r-l) 











scent (iv)*, 
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Turning to (iii) we may write it, since n—r=s, 
? 


ao otti it ee ¢-9 


pr” N+1 Ur |s lp - r \q-s |V : 


or we see from (ii) that 


i.e. the probability of the original population being p is to the probability of 
drawing a sample with r out of a population p as xn +1 is to N+1. Since 


Cyr = Crp/S)- “me C;, p> 
it follows that aes °C,» =(N +1)/(n +1). 


The proof by which the value of C, is deduced may strike the reader as 
artificial, and may raise doubts in his mind. Actual numerical tests will soon 
convince him, however, that 
r+1N—n 4 ti)(r+2) (N —n)(N—n-1) 

1 N-r 1.2 (N—r)(N-r-1) 
js |V—s |N+1|n—r 
pee ae ; 
ee B+lie—-e 


For example, if V = 15, n = 10, r= 4, we have for the series 


1+ 





14 25, 30, 70,35 3 | _ 104 
ibe ke ee Be lke 


but we have 
\V+1 |n—r 116 |6 _ 104. 
Int1[N—-r |lljll 11°’ 

We can now express the moment coefficients of our series by throwing it by 
aid of (v) into the hypergeometrical form, for which these quantities are known. 
Thus: 
a8 a(a+1)8(8+1) 
Lly 2!y(y +1) 

Now if we write \ =y—a—§-—1, then the distance a, y,' of the mean of the 
hypergeometrical series from its first term is 


, ap 


F(a, B, y, 1)=1+ 


and the moment coefficients about the mean are given by* 


_ aB(X+a)(A+ 8B) 


bs 5 | RR a a noha adc (1x), 
_ a8 (X + a)(A + B)(A + 2a) (A + 28) Ga 
Ms a. ee x), 


_ aB(A+a)(A+8) epee 
MQ DA—DA—S) TY CMTE ETH) +1) 


+ A2(3aB8 (a+ 8) + 6 (a+ B)? + Gap) 
+r (3078? + 18a8 (a+ 8))+ 18a?B7} ...... (xi). 


* See Philosophical Magazine, February, 1899, y >. 288—240. 
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Finally, the distance of the mode from the first term will be given by 


; a—1l -1 =: 
Mode = nearest integer to ( et: Se (xii). 
y-a—B+1 
Now it is extremely troublesome to compute the successive terms of a hyper- 


geometrical series which may have several hundred or perhaps thousand terms, 
even if we begin with terms in the neighbourhood of the mean, and compute 
only those of practical significance. It is better therefore to replace the hyper- 
geometrical by its appropriate frequency curve, and in calculating the probability 
of different constitutions for the sampled population, use the areas of this curve. 
Applying (viii) to the above results we have 
(r+1)(N—n) 
n+2 


This result is of considerable interest. In the usual practice the mean would 


Mean 








BAU Saabs te euceuceces (xiii). 


be taken =rN/n. This is only true approximately if r, V and n are large numbers 
and WV very much larger than n. 


Mode = . CP DEY Nekcstsavicco mnie (xiv). 


Since - is less than unity, the mode may be taken as - (N — n), or the mode 
will be greater or less than the mean, according as the number of marked in- 
dividuals in the sample is greater or less than 50 per cent. of the sample. We 
can easily see the influence of these formulae on the usual result by taking a 
numerical example. In a sample of 100, from a population of 1000, 20 are found 
with a marked character. We should then have: Mean as found from the ordinary 
theory = 200, mean as ->.und from above theory = 20 + 185°3 = 205°3, and mode 
= 20 + 180°2 = 200°2. The most probable result of the sampling therefore is that 
the original population sampled contained 200 marked individuals, although 
deviations in excess of this would be more frequent than those in defect. 
We now turn to the standard deviation. This is given by 


(r+1)(n—r4+1) (V+ 2) (NV - n) 
(n+ 2)? (n+ 3) 
Met Bnd oes 1) edveneseaaal (xv). 


4 9 siete 
(N +o 


.,=o = 


geo 24-2 
The ordinary theory would, I take it, suppose the squared standard deviation 
of the original population’s frequency to be best given by 


e rs 


Fg EE | CERIO TRAE ERR RAC eee (xv). 
nn 


which differs very considerably from that of the present theory and does not 
provide for «=0, when n= WN or we take the whole population as our sample. 
It will be clear that when the sample is a small one, the variability of the dis- 
tribution of possible actual populations is very large indeed. 
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In the case of the particular numerical example we have already dealt with, 
where r = 20, n= 100 and V = 1000, we find 
My = 143174526, or o, = 37°8345. 
On the old theory 
fy = 160, or o, = 12°6491. 


Naturally, however, o, will tend to become infinite with N, and what we 
really want is the variability of p/N or better in percentages 100 p/N. Thus the 
two ways of approaching the problem give: 


Mean Percentage 8.D. in °/ 
Present way ..........+ 20°53 3°78 
ik are 20°00 1:26 


° 


which sufficiently emphasises that very different conclusions will be drawn from 
the two methods. Generally from (xv) 





— (r+1)(s+1) 1) (14 2) fy 4 = 
WINN (n +2) (n+2) (v+3) i oe: ae tia 
If N becomes indefinitely large, n remaining finite, this reduces to 
ae Je +1)(s +1) (xvi) 
p[N (m2) (wt Bytes xvi)", 


as against the usual formula 
/*s 
Cyin = me 
Pl \ n , 


The deviation of this from (xvi)"* will be considerable, when we are dealing with 
small samples. 


We now turn to the third moment coefficient in (x) and substitute the values 
from (v). We find 
_(r+1)(8+1)(s—1r)(N—n) (N+ 2)(N-1n+ N42) ‘ae 
iattd (n+2) (n+ 3)(n+4) — | ee 
Clearly w,; will be negative when r>s, i.e. greater than 50 per cent. of the 
sample, but this was the condition already found for the mode being greater than 
the mean or the skewness negative. 


If M, be the pw, for p/N we must divide by N* or 


(r+1)(s+1)(s—7r) n \ 2‘ n—2 eas 
MM. = _ . o XV bis 
2 hab | N/ (1 si w) | N ) as 





or, for sampling an indefinitely large population, whatever the size of the sample, 


M.=2 (r+1)(s+1)(s—r) 


Roeneueveneuye recente (xvii). 


~(n + 2P (n+ 3) (n+ 4) 
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However, for curve fitting, it is the constant 8, = u,*/u.* that we generally use 
and not u;. For the hypergeometrical series in general we have 
_ (A+ 2a) (A+ 28P (A—1) 


B= 7B +a) A+tBa— ors (xvill), 
and this for our particular case becomes 
_ (s—r)y n+3 ( (n +2P = ) To 
A= Ge Det ine att Wem ea) OM) 


a fairly compact form. 
For the special numerical case previously used as an illustration 
B, = 08084, 
which indicates a certain amount of skewness in the distribution of possible p’s, 
but for 20 per cent. of marked individuals, the hypergeometrical series 1s approaching 
symmetry. Turning to », we find that for the general hypergeometrical series it 
may be written 
MeN cee Wega qh tO toe + Fe) 
+? (3aB8 (a + B) + 6 (a+ 8) + 6aB) 
| +2 (3028? + 18a8 (a + B)) + 1802A%}.........(xx). 
Or, again, 
_— aB(A+a)(A +P) 
am VA-—DA—2)Aa—8) 
(A(A+1) + 6a(A+a) + 6B(A+ PB) +3(A+ 6) aB(A+a)(A+8)) 

















r2 
en eres (xay™. 
Hence 
a. =3A+6A-1), MVA- 1) { AAt) 
~ fe — (N—2)(A—3) | (A—2)(A—38) (aB (A+ 4) (A+ B) 
6 6 | ie 
+ BaAdB) aad A Beale (Xx1). 
For our special hypergeometrical this becomes 
8 _ 3(n—4)(n+8) ear er |(6 (n+1)(n+2) ) 1 
* (n+ 4)(n+5) * (n+4)(n4+5) | (N —n)(N + 2)/ (r+1)(s +1) 


am ae 6 (xxi)?* 
(N + 2)(N — n) . 

It seems desirable to apply this to a numerical example, and we will deal with 
that already discussed, in which we supposed a sample of 100, out of a population 
of 1000, showed 20 marked individuals and we required the probabilities of various 
numbers of marked individuals in the original population on the basis of this 
sample. We give first the frequencies of the various values of p*. 

* The probabilities were found in succession, each from the preceding one; this required eight 
decimals; these were multiplied by 1000 and three figures cut off. The work is far more laborious 
than finding the curve in (xxiii), and plotting it. The planimeter will then give any required area 
with ample statistical accuracy. 





Number of Marked Members in a sampled Population 


Frequencies Per Mille of various possible Populations. 
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Using the formulae (xv), (xix) and (xxi)"* for o*, 8, and £, respectively, we 
find the following Type I, limited range curve—where the distance from the 
origin (mode of curve) to the mean, taken as the same for curve and hyper- 
geometrical, is 5°5950—is 


v 19-9076 x 795115 
= 10°5606 (1 “a 
ie ( * ao550) (1 7EeaeTs) 


the origin of the curve being the mode at 1996991. Diagram I shows the 
curve superposed on the hypergeometrical histogram; the accordance is remark- 
ably good. But the labour of fitting the curve after we have found 8, and 8, 
is considerable. Examining the resulting curve we see that the two powers are 
singularly close to the values of r and s in the sample. Let us assume that they 
are such; and accordingly if b be the range of the curve, we shall have 

a,=rb/n and a,=sb/n, 


and for the equation to the curve 


rb " /sb s 
Y= Yo ( n = *) ( ia n) ' 


If M be the total number of possible populations from which the sample may 
have been drawn, usually taken as a thousand, then r and s are integers; we have 


M (n+1)! 
= pen 


riley * 
or the equation to the curve is 
M (n+1)!/rb r “sb . as 
4 . 
y= ——$— $$$ F/ 4 Te tee eeedesoseeaseun XX11). 
y beh rl gs! (= * r) ( r) ( ) 


It remains to consider a suitable method of determining the curve range b. 
The following suggest themselves : 


(i) We might make the range of the curve equal to the range of the hyper- 
geometrical. But the hypergeometrical is zero for p=r—1 and for p=N-—s+1 
or the total range is N—n+ 2. This is the value which should accordingly be 
given to b. 

(ii) Our Diagram I indicates, however, that the practical range of both curve 
and hypergeometrical falls far short of the absolute range. We may therefore get 
a much better result by equating some physical constant of the curve and the series 
which is more important than the range for the useful part of the series. As a 
second possibility we might make the distance from mode to mean the same for 
curve and series, i.e. 

b(s—r) = (N —n)(s—7r) 
n(n+2) , n(n+2) 

This gives b= N—n, which will usually be very close to the result in (i). 
Indeed, had we taken our range from first to last term in the hypergeometrical, 





the two ranges would be the same, and the distance from mode to mean the 
same for curve and series. 
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(iii) As (i) and (ii) are likely to succeed or fail together another mode of } 
considering the matter is suggested. We may equate the standard deviations of 


, ae . B(r+1)(s4+1 
curve and series. Now the standard deviation of the curve is A — ) 


ae (n+ 2) (n+3)’ 
and of the series al dn’ al " e+ hy +2 
(n+ 2)? (m+ 3) 


be =(N + 2)(N —n), I 


ee —n+2 
or b otf eae. 


Hence we have 


7 
This will accordingly give a somewhat larger range to the curve than in (1) } 
or (ii), but has the advantage of making the variabilities in the curve and the 
series the same. } 
(iv) Still a further method was tried, namely, not fixing the powers of the two 
factors of the equation at r and s, but so choosing them that both range and 
standard deviation were the same for curve and series. This, however, was found 4 
to be more laborious, and to give less satisfactory results, than adopting (xxii) } 
and choosing b appropriately. Finally, comparing the results obtained by choosing \ 
b as in (i) from the range or as in (iil) from the standard deviation, we found that 
for a series of curves the latter gave markedly the better result, and that the 
description by means of* ; 


M (n+1)!/rb t /sb 8) 
Y= Ta . + w) ( = x) | ie i 
oot! rls! \n i 5 et Weccecasuanares (xxii), 
where b= v( N+ 2)(N— n) ; 
was really as good practically as if we had used the long formula for 8, and ~,, 
and actually fitted the curve by its first four-moments to the hypergeometrical 
series. We therefore propose to adopt (xxiii) as the practical solution of the 
problem of describing the distribution of possible sampled populations. } 
We will illustrate this by three examples. 
Illustration I. Population of 1000. Sample of 100 shows 20 affected individuals. 


’ 
“quation (xxiil) provides us with the following result : 
Y = Yo (189°9263 + x) (759'7052 — x)”, 
where log Y% = 275°000,407. ) 
Here the mean and standard deviation of series and curve are identical, and 
it takes a tenth of the labour to write down the equation to the curve that the j 
fitting by moments takes. J¢ was not, however, possible to draw this curve on 
* For some purposes, for example, in determining the modal ordinate, it is convenient to write the 
equation to the curve in the form 5 


_M (n+1)! 71" 8 n x\" s = \* 
y=> n® ca(its i) (1-85). 


M (n+1)! 77 s8 
Ino. = ’ ,? 


J(N+2)(N—-n) m™ ris! 


Thus the modal ordinate 


and can be easily calculated logarithmically from tables of the log. factorials. 
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Diagram I, for it could not be shown as distinct from the moment curve on the 
scale of the diagram. We note the following results : 


Mode of 
Populations Range 
Actual Series aa wats ae — 200°20 900 
Curve fitted by Four Moments _... wale 199-70 946°5 
Curve fitted by assuming powers of factors 199°7] 949°6 
to be r and s 


It will be seen that the mode is slightly out of position for both curves, but as 
the modal number can only be a whole number for all populations, it is 200 for 
all three cases, ic. the number in the sample increased up to the size of the 
population. The curves give good fits to the bulk of the practical part of the 
series by extending the range beyond its actual value, but in this extension they 
provide no practically sensible frequency. 

illustration II, Population of 100. Sample of 40 showing 10 affected in- 
dividuals. 


The two curves have for their equations 
Curve from Four Moments: 
a“ 


. \ 88616 4 x 26-1827 
issi03) ( ‘ sra008) ‘ 


y = 75°5688 (1 + 
Curve from Equation (xxiii) : 
y ="8177 x 10- (195576 + x)” (58°6728 — x)”. 
The histogram of the hypergeometrical series is shown in Diagram II, but the 
two curves so nearly coincide that they cannot both be represented on a diagram 
of this size, It is clear that (xxiii) gives a perfectly adequate representation of 


the hypergeometrical series*. We note the following results : 
Mode of 
Populations Range 
Actual Series bi me see 25°25 60 
Curve fitted by Four Moments ... 24°74 732 
Curve fitted by Equation (xxin)... 24°78 78:2 


The mean of the series is 25°714. It is clear that the modal value to the 
nearest whole number—which is all that can occur—is 25 for the series and both 
curves. It is further evident that the goodness of fit for the curves which have 
the same mean and standard deviation as the series is attained by prolonging the 
range into regions, which, if theoretically impossible, still do not for practical 
purposes provide any sensible frequency. 

* If we make the range and standard deviation the same as for the hypergeometrical series, but let 
the powers in the equation differ from r and s (Method (iv)), we find 

y ='377787 x 10-* (14°4327 + 2)°96°7 (43°5673 — x)16-9814, 
which gives an excess of frequency at the tails and a deficiency at the mode. The curve does not give 


a bad fit, but it is inferior to either of the two considered above. The same was the case in Illus- 
tration I when Method ,iv) was applied. 
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Diagram II. 
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Illustration III. Population of 30. Sample of 10 showing 2 affected in- 
dividuals. 

This illustration was taken to indicate what the power of description of the 
curves might be in still more extreme cases. The equations are : 

Curve from Four Moments: 


x“ 71 £ 6-9367 
y = 131-051 (1+ F718 (1 54170) 


Curve from Equation (xxiii): 


/ “x = x \ 
= 131°309 al : 
y = 131°3094 © +: =a500) (1 20-9386) 


Diagram IIT shows both curves, which with small samples from small popu- 
lations differ visibly in a diagram of this size. But the deviations of one from 


SAMPLE OF 10 SHOWING 2 AFFECTED INDIVIDUALS 
DISTRIBUTION OF 1000 PoSssIBLE SAMPLED POPULATIONS OF 30. 
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Diagram ITI. 
the other and both from the histogram are not of a kind to be of any real 
importance in practical statistics. In this case the four-moment curve gives the 
somewhat better result. 

The following Table giving the terms of the hypergeometrical and the ordinates 
of the two curves will enable the reader to judge the degree of accuracy with 
which even in an extreme case like the present the curves can be used to replace 
the series. 


Biometrika xxA 11 
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Distribution of 1000 possible Populations of 30, when a Sample 
of 10 has shown two Marked Individuals. 





Actual Hyper- 
geometrical Series 
Value of Terms 


Four Moment 
Curve Ordinates 


Curve Equation 


(xxiii) 
Ordinates 











Marked . a 

Individuals [1°5 13°40] 
4 36°71 34°68 

3 78°66 78°65 

i 110-70 111°27 

5 127°74 128°25 

6 130°29 130°02 

7 121°60 121-28 

8 105°74 105°34 

9 86°52 86°23 

10 66°95 66°84 

11 49°09 49°26 
12 34°11 34°25 








Total 999°98 [At Mode 5°6475 131°05 





~ 
— 
bo 
+ 
bo 
Pp 


It is worth noting in this case that the hypergeometrical series has only a 
slight degree of leptokurtosis (8, = 307976) but a considerable degree of skewness 
(8, = 37364). We have the following results, the mean of the series and the two 


curves being 7. 


[15 15°95] 
34°89 
76°41 
109°47 
127°61 
130°87 
122°38 
106°48 
87°00 
67°16 
48°98 
33°66 
21°81 
13°05 

7°85 
41°19 
2°04 
“89 
*B4 
‘11 














Mode of 


5°7350 131°31] 














Populations Range 
Actual Series ae Sad 6:2 20 
Curve fitted by Four Moments 5648 23°83 
Curve fitted by Equation (xxii) 5°735 25°30 


Again we see that the mode as a whole number in the series and the curves 
is 6, and corresponds to the number of affected individuals in the sample, 1.e. not 
the mean but the modal value in the sampled populations corresponds to the 
sample proportion. It will be further noticed that the goodness of fit of the 
curves to the series (not so good now at the start of the hypergeometrical series) 
is obtained by extending the range. This extension is of no importance practically 
on the right of the curves (see Diagram III), but it is of some importance on the 
left, where the curves instead of starting at 1°5 of the histogram of the hyper- 
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geometrical series start at 0°932 and 0675 respectively. The small frequencies 
below 15 must be thrown into the frequency centred at 2 in such cases. It is 
possible that in this case our Method (iv) might give a closer result at this end 
of the series, but the general result is so satisfactory that it seems best to 
maintain a single method for all cases, namely, that. of Equation (xxiii). This is 
so simple in application that it is preferable to the four-moment curve, although 
the latter gives a slightly better fit to the histogram. 


Besides the importance of the present problem itself as providing a measure 
of the manner in which the sampled population may differ from the sample, our 
curve enables us readily to provide an asymptotic expression for any number of 
terms of a finite or of a converging hypergeometrical series, which may possibly 
be worthy of the consideration of the mathematician. 


While attention has been frequently paid to the problem of future expectancy 
on the basis of past experience I do not know of anyone but Laplace who has 
attempted the present problem on the basis of inverse probabilities. In his 
treatment certain difficulties arise which I have considered in Appendix I to 
this paper. 


I have to thank very cordially my colleagues Miss Margaret Moul and Miss 
Elizabeth Worthington for much arithmetical aid in this investigation of a simple 
curve to represent the hypergeometrical series. This aid is not fully represented 
in this paper, as several variations of Equation (xxiii) were tried from which the 
most satisfactory could only be picked after all the curves had been fitted to the 
series. 


APPENDIX IL. 
NOTE ON THE THEORY OF INVERSE PROBABILITIES. 


I am fully aware of the criticisms which have been raised on the use of the 
theory of inverse probabilities: see for example Venn’s Logic of Chance, Chapter 
vil, § 9—18. But let us look into the actual needs of statistical science in 
ordinary practice. It is rare in anthropometry, especially in craniometry, in 
psychology or in medicine that a complete population is measurable; we must 
content ourselves with a sample. According to Dr Venn we cannot infer from a 
known sample anything as to the population sampled. We may draw conclusions 
for our sample but for nothing else. If we compare two samples of crania, one 
from Tibet and the other from Japan, we cannot assert anything about the 
difference between Tibetan and Japanese races as such, because we must not use 
the theory of inverse probabilities te determine some more or less approximate 
measure of the divergence of the populations from their samples. Or, may we be 
allowed to do so on the same ground as Dr Venn allows theory to be applied in 


the case of two samples of medical treatment to determine which in general is 
more efficient ? 


11—2 
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That is because: 

The problem discusses simply a balance between two extremely similar cases, and there is a 
certain set-off against each other of the objectionable assumptions on each side. Had one set of 
experiments only been proposed, and had we been asked to evaluate the probability of continued 
repetition of them confirming their verdict, I should have felt all the scruples I have already 
mentioned. But here we have got two sets of experiments carried on under almost exactly 
similar circumstances, and there is therefore less arbitrariness in assuming that their unknown 
conditions are tolerably equally prevalent. (Logie of Chance, Edn, 1888, p. 187.) 

The writer seems here for a moment to realise that nearly our whole conduct 
in life depends on judging from samples as to sampled populations*. If past 
experience is no guide to future happenings, no man could get on in life, and the 
fact that every insurance company does judge from past experience to future 
happenings, and succeeds in so doing, seems to preclude us from describing all 
hypotheses by which we pass from one to the other as “obtaining solutions about 
matters on which the human intellect has no right to any definite quantitative 
opinion.” It is difficult to understand why if we may not apply it to one popu- 
lation we are justified or partially justified in applying it to two in the above case 
of medical treatment. Dr Venn seems to suggest that any hypothesis applied to 
both would lead to the same result. I see no reason for this whatever. I think the 
true answer is this: that we have learnt by wide experience that a sample does 
within broad limits describe the population sampled. Otherwise the sale of corn 
and coal and oil in bulk would be impossible. We know further from experience 
that the larger the sample the more accurately it describes the population sampled. 
These are not hypotheses, but the conclusions of wide human experience, without 
which not only the work of every practical man but of every scientific investigator 
would be idle. The question then is whether we shall assume the sample actually 
describes the population sampled+—although we know by experience that it 
actually does not—or consider all the possible populations from which the sample 

might have been drawn. If we adopt the latter course, we can certainly ask 
whether some of these populations are more likely to have occurred on the present 
occasion than ethers, and weight those populations accordingly. But to do this 
is not to appeal to some extraneous knowledge, any such ought @ priori to be 
introduced into the statement of the problem. Such extraneous knowledge usually 
amounts to stating that we have directly or indirectly sampled this or a like 
population previously, and accordingly is really equivalent to an increase in the 
size of the sample and can be allowed for in that way. Given the possible 
populations and no extraneous knowledge we have to consider how they should 
be most reasonably weighted to give the observed result. It is difficult to conceive 
any more reasonable weighting, based on a single experimental result, than that 
which weights them with the probability that if they had existed they would have 
P 


Dr Venn would justify the comparison of two medical treatments, but would not allow an 
argument as to a new treatment, where there had previously existed no treatment at all, e.g. say 


g 
for Sleeping Sickness, twenty years ago. 

+ Insurance companies usually adopt this method, and I presume justify it on the ground of the 
size of their samples. 
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given the observed result*. Had we in any particular case knowledge that the 
population sampled belonged to a particular class of populations which had 
marked individuals with a certain frequency distribution that would be another 
matter, but this knowledge is supposed excluded by the statement of the problem; 
we have to weight by our single sample. As I have said, such extraneous knowledge 
usually amounts to increasing the size of our sample. For example, if previous 
experience gives 7 marked individuals, and s’ unmarked, then r+7r’ and s+s' 
should be used instead of 7 and s, and if 7’/s’ is much greater than r/s there will 
clearly be, on the hypothesis of inverse probabilities, a great shifting of the dis- 
tribution of possible populations. I have endeavoured to allow for some @& priori 
knowledge of the type of populations sampled in the case of determining the most 
suitable value of a correlation coefficient in a sampled population, when we have 
some knowledge of the coefficient in similar populations (Biometrika, Vol. X1. 
pp- 352—360). But where our knowledge is based upon a single sample only, and 
this however large, we must either decline like Dr Venn to make any use of this 
knowledge—in which case practical life becomes impossible—or we must appeal to 
the principle of inverse probabilities to give us some measure of the accuracy with 
which we are describing the sampled population. 

Dr Venn and some other writers assume that the possible populations might 
be distributed according to some other law of frequency than that demanded by 
the law of inverse probabilities. For example, let f, be the frequency of popu- 
lations with ¢ marked individuals, and let S(/,)=1. Then these authors next, 
without noticing their assumption, suppose that Nature in the particular case of 
the sampled population presented to us has drawn at random from this bag of 
sampled populations. But why should Nature have drawn indifferently from such 
a bag? She may have drawn with a bias represented by the probability q of 
coming from f;. If so the probability of drawing a sampled population with ¢ 
marked individuals is not f,, but f, x q, and the only reasonable measure | 
personally can give to this product of two unknowns is to be based on the fact of 
the observed single sample and on the probability p,, of drawing r marked in- 
dividuals in that sample from a population with t We thus reach p,,=ji X 
and we see that the weighting q will be directly proportional to p;,, and inversely 
as f;. The dogma that the sampled population has been selected in the course of 
its formation from a frequency distribution S(f;) seems to me far more unreason- 
able than the hypothesis that we must weight our possible populations with the 
probabilities that they would give the observed result, the only result we know. 
We can if we choose appeal to the Edgeworthian doctrine} that it is a broad 
result of experience that populations with any number of marked individuals 

* It has been objected to the use of inverse probabilities that it supposes the existence of populations 
which we may be fairly confident do not exist; the frequency of such populations as given by the 
theory will, however, be indefinitely small, and need trouble us no more than the indefinitely small 
frequencies of men 20 feet high or of zero stature do, when we represent by a normal curve the 
stature distribution of a population; or again than the infinite values of the mean and standard 


deviation which theoretically arise when we draw samples from a normal population. 
+ Mind, Vol. tx. pp. 223—235, 1884. 














166 Number of Marked Members in a sampled Population 


from none up to all are equally likely to be found. It is possibly a reasonable 
doctrine for a considerable portion of our frequency curve of populations, but 
I am inclined to think that the populations with none and all, giving probabilities 
equal to certainty, would be found to be lumped up. Let us suppose for illus- 
tration that a mark was made on our scale of skin pigmentation and the number 
of individuals in a particular race with a darker pigmentation was found to be r, 
and without knowledge of other races, we wanted to predict something about 
the percentage of dark individuals in the sampled population. Now clearly with 
such a method of estimating dark individuals in a race, the frequency distri- 
bution of populations would not be a horizontal line; there would be an excessive 
lumping up of the white and dark populations at either end of our curve of 
population frequency. It is true that some of this is effectively cut off by the 
knowledge that there are at least » marked and s unmarked individuals in the 
sampled population, But I think it more than reasonable to argue that the 
formation of the observed population was not equivalent to a random selection 
t=n-s 
from the remainder S(f;) of such a frequency; we are justified in supposing a 
AA 

bias or weighting to have occurred, and the most reasonable weighting to adopt 
is that of choosing the frequencies to be such that they are proportional to the 
probability they give of the observed result. Thus I do not think it is necessary 
to appeal to an @ priort equal distribution of populations, but to a reasonable 
weighting of any distribution, with a not necessarily random selection. The 
“random selection” is as much a dogma as to the formation of the sampled 
population as is the assumption that possible populations are @ priori equally 
probable. We choose the frequency from which we are to make a random selection, 
not to be a straight line, but to be the one most probable on the basis of the 
sole event observed. 


APPENDIX II. 
NOTE ON A SOLUTION DUE TO LAPLACE. 


Towards the end of the 18th century the question of a census of the popu- 
lation was becoming a prominent question in the greater European countries and 
an attempt to solve it for France by records of births and by test samples was made 
between 1783 and 1789 by illustrious scientists. In the Mémoires de l Académie 
des Sciences, published in Paris, 1786—1791*, will be found for the following 
years, 1788, pp. 703—718; 1784, pp. 577—593; 1785, pp. 661—688; 1786, 
pp. 7083—717 ; 1787, pp. 601—610; 1788, pp. 755—767, portions of a memoir 
entitled: Essai: Pour Connoitre la Population du Royaume & le nombre des 
habitans de la Campagne, en adaptant sur chacune des Cartes de M. Cassini, V année 
commune des Naissances, tant des Villes que des Bourgs & des Villages dont il est 


* The Mémoires came to an end with the volume for 1789, issued in 1791, and were not resumed 


until after the Reign of Terror, Robespierre having proclaimed that there was no further need for 
men of science, 
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Jatt mention sur chaque Carte; présenté & V Académie. Par M™ du Séjour, le 
Marquis de Condorcet & de La Place. 


The object of the Essai was to provide an estimate of the population of France 
and its geographical distribution. The method employed does not get except, 
perhaps, in accuracy of estimate, beyond that of old John Graunt; namely, it 
consists in determining for a sample the ratio of births in a year to the population 
producing those births, and then ascertaining the number of births in a year in 
each urban and rural district in France and multiplying by the ratio of population 
to births determined for the sample. This ratio was taken as 26 for the towns 
and villages but as 30 for Paris. It will be clear that we have here a definite 
case of proceeding from a sample to the population sampled. There would be 
little interest now in this attempt to take 2 census of France on John Graunt’s 
lines were it not that a memoir by M. de La Place is affixed to the first issue of 
the joint memoir. It occupies pp. 693—702 of the Mémoires for 1783, and is 
entitled: Sur les Naissances, les Mariages et les Morts a Paris depuis 1771 jus- 
quen 1784; & dans toute l’étendue de la France, pendant les années 1781 & 1782. 


The title gives no idea of the contents of the memoir, which may be defined as 
follows : 


Supposing a statistical ratio to be determined from two quantities 2, and y, 
by means of a sample from the whole, and we know one of these quantities 
X, for the whole, what error are we likely to make in estimating Y,, for the 
Ys ? 


whole as equal to X, x 


Now it is clear that the variability of Y,, will depend on three things, namely, 
on the variability of y,, the variability of #,, and thirdly on the correlation of 
x, and y; in samples of the same size. That is to say, the variability of sample 
populations, of sample births and the correlation of the size of population with 
number of births. 


Now how does Laplace treat this problem and obtain a measure of the error 
introduced by his sample into the estimated population of France ?* He applies 
an extension of Bayes’ Theorem to predict from a first sample p, gq observed, what 
a second sample with known q’ but unknown p’ might produce. There seems to 
me here something questionable. His first sample is a sample of the whole 
population of France, it is a part of a limited population; what right has he to 
consider both to be samples from a third, an infinite population, and treat them 
as if they were independent samples? His sample of the population is roughly 
one-thirtieth of the total population and this in itself limits the range of values 
that it is possible to give to p’ in the total population, 


To understand this criticism it is desirable to state the practical problem with 


which Laplace wishes to deal. Using a factor of 26 or 26-5, as the case may be, 


* He uses the same method with a slightly different notation in his Théorie Analytique des 
Probabilités, pp. 391—394. Ed. Paris, 1812. 
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ascertained from the sample population as the ratio of population to annual births, 
we determine the total population of France, from a knowledge of the 973,054 
annual births, to be 25,299,417 or 25,785,941 according to the factor used. 
Laplace asks what must be the size of the sample in order that there may be 
a very great probability that we are not more than 500,000 persons wrong in our 
estimate of the total population. Laplace—like many of his countrymen—seeks 
for a corresponding urn problem as a means of finding the required solution. He 
writes (p. 696) in effect as follows : 


Let us consider an urn which contains an infinite number of white and black 
balls, and suppose that in a first drawing we have obtained p white and q black 
balls. Next let us suppose that in a second drawing we have obtained q’ black 
balls, but that we do not know the number p’ of white balls drawn with them; 
the method which presents itself for determining this number in an approximate 
manner is to suppose it in the same relation to g' as p to qg, which gives us: 
p =pq/q for the number of white balls. Accordingly we shall determine the 


/ 


sa , ae ( 
probability that the true unknown number lies between the limits p ! (1 —w) 
( 


and p d (1 + w), or what is the same thing that the error of the result pq’/q does 
q 


tf 
not exceed p l wy, 


qY 


But it appears to me that Laplace’s urn statement does not in the least fit his 
actual problem. We are not taking a second sample from an infinite population. 
We have only one sample and we want to learn something about the population 
from which it has been sampled, which is finite in extent, although its extent is 
unknown. We do know, however, that it contains q’ white balls, ie. births in all 
France. There appears, however, to be a slight difference between the treatment 
in the Mémoires and in the Théorie des Probabilités. The latter appears to deduct 
the births from the population, and treat the remainder as the varying quantity; 
in the former it appears to be the total population without deduction of births*. 
Whichever may be supposed taken, let us term them p, and p,’. 


Now Laplace takes an urn in which there are an infinite number of black and 
white balls and puts the chance of drawing a white ball as z. This he is going to 
consider as the chance of a birth, and the chance of q’ births will be a”. Now if 
for a moment we forget his urn and suppose the chance of an additional unit of 
population to be y, the chance of drawing p,' units will be y”’, and if we suppose 
the units of birth and population to come in any order then the chance of q’ 
births and p,’ population will be 


[T+ op 
lq’ Ip ™ ¥y * 


* Cf. pp. 697—698 of the Mémoires with p. 392 of the Théorie, but in both cases I venture to 
think the treatment obscure. 
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Laplace, however, following up his own problem, puts y= 1 — « and says that 


lq +Pr q(1 yp 
wr 7 & — 2; 
1 \P 





is the chance that, if the chance of a birth in a year be , the ratio of births to 
population will be q’/p,’.. Iam quite unable to follow this argument; it appears 
to me that the great master by adopting an urn problem has lost touch with 
reality. I fail to see how births and numbers in a population correspond to black 
and white balls in definite proportions in an urn, so that if on “trial” (trial of 
what ?) a birth does not occur, then another unit of population must occur. It 
sounds almost a self-contradiction, whether we are dealing with a population 
which includes or excludes births. An increase of births seems to be correlated 
with an increase of population and not a decrease, as Laplace’s urn problem 
suggests. Yet there is a very real problem behind Laplace’s analysis and it is 
easy to see cases to which it might apply, but an essential condition of them 
is that the two variates should be erclusive*, and this appears to be far from the 
truth in births and units of population. 
Laplace, having reached the expression 
) gq +p 
q’ |p 


for the chance of births q’ and population p,’, now puts p,’=0, 1, 2... 9, and 


a? (l1—a)” 


sums for these values, i.e. he assumes all values of p,’ from 0 to © equally likely 


initially, or “he distributes equally his ignorance.” It is easy to see that the sum 
=a7(1—(l—a2)-We= 
Hence, applying inverse probabilities, 


q . al “*(1—2yp 
7 |Ps 

is the chance of g’ being associated with p,’.. Again I think Laplace is at fault, 
I do not see how it is possible to give p,’ this system of values in his actual 
problem, because we know from our sample that there is at least a population 

of p, for we have sampled that number, or if we subtract births p—q. On Laplace’s 
assumption that we are taking a second sample from an infinite population, this 
might be true. Again it is difficult to follow how a zero population could be 
associated with q’ births! 

' Returning to Laplace, he now says that the probability of the ratio « occurring 

as judged by the first sample is (by the law of inverse probabilities) 

x7 ( l—«a2)” / BSS | a1(1—a2)’idzx= ai(1 -- aye /| ; w1(1—a)” dx. 

\P. |9 / Pr \7 /0 1 Je 


rt 





) 


For example, if every person vaccinated were registered, then Laplace’s problem would apply to 
i determining the total population from the total registered vaccinated on the basis of counting vaccinated 
and unvaccinated in a sample. 
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Thus he reaches the following expression for the combined probability of 
q and p, occurring with the value a: 





Pr + q' atta'+1(] — @)Prter 
ln’ la 2 br ene i 
(Pa lv [ ay qd one wx) dx 
“0 
The next step taken is to sum this for all values of p,’ from 0 to some integer 


s, and then to integrate for « from 0 to 1, and we thus reach the following 
expression for the chance of p,’ lying between 0 and s: 


1 . P > 
| gat? (l—«a)P (1 + (q’ +1)(l—«#)+...4+ q +1)..-@ + 8) (l— zy) dx 
s 


; ' 
p= 





< 
| ai(1—a)ida 
nr Ue Se ee (xxiv). 
Laplace now says that when q’ and s are large 
(7 +1)(q' +2) 
2! 


1+(q¢+1)-2)+ (l—a) +... 





‘ (q+ =D: (q' +3) (1-2) 


J1—z 


pl 


Poi +1 “1 . 
S Je / , 
| a’s(1 — 2’) da 
( 


This can easily be shown to be the case whatever gq’ and s may be, whether 
large or not. Hence 
1 1 ; 
| a1 (l1—a)ra [ a’s(1—2’)t da’ dx 
(i J1-2 


Le cee nee (xxv). 


"es l 
| 24(1— 2)" da x | a*(1 — a’)? da’ 


Laplace then appeals to a previous Mémoire in which he has treated the 
evaluation of the integral in the numerator when q, p,, s and q are large 
numbers*, This really amounts to replacing the integral of the series for P 
by the integral of the best fitting normal curve. I think we can get a more 
exact result, and deduce Laplace’s value from it, by regarding the numerator of P 
in (xxiv) as a series of complete B-functions : 


pages +t BG@t+7¢ +2 mttt+) 
t=0 \q’ \t B(q+ 1, p,+1) 
_|atg+1 lq+pitl 
dat +p. +2 








Fe +1,p.4+1,9+ 94 +p.+3), 


where F, denotes the sum of the first s+1 terms of a hypergeometrical series. 
This series is convergent because y—a—8—1=q. Hence by using the formulae 
of the paper to which this is appended we can get the sum of any number of 
terms by fitting with the appropriate skew curve and finding its area. 


* “Sur les approximations des Formules qui sont fonctions de trés-grands Nombres.” Mémoires 
de V Académie des Sciences, 1783, p. 430. Paris, 1786. 
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We have 
Mean -__ 28 -  -@+ D(a +1)_ op 
y-a-B-1 q “ 
if as Laplace supposes we are dealing with large numbers. Again 
gt AtaAt8)_ (9 +VD(mtDGt7+DG+mtY 
V(A-1) g(q-1) 
_TAG+YGtp) 
g ° 
if the numbers be large. Again, 








, , 
Mode = “P: _¢P: _ Mean, 
q+2 q 

if the numbers be large. Further, 8,» 0 and 8, 3, if the same condition holds. 

Accordingly the hypergeometrical series approaches the normal distribution, 
and this is what Laplace’s method really amounts to. He takes the distribution 
of p, to be a normal curve about the value p,q’/qg as mean with the standard 
deviation 





a= / tras 1 G+ w) 
q° 
This is in agreement with Laplace’s solution in the Mémoires de ? Académie, if 
we put p,=the population, and in accordance with the Théorie des Probabilités 
if we put p,=p—gq, i.e. population minus births in the sample. See p. 393 of the 
1812 edition. Thus the problem is reduced to a determination of what the size 
of p, should be in order that 


dak behiagan eeeieeeen pen (xxvi). 


a — q 
(=n) 
should provide a probability, as given by the Tables of the Probability Integral, 
which will be less than a fixed amount. Laplace takes the case of an error 
of half a million in the estimated population either in excess or defect to have 
odds of 1000 to 1 against it. Accordingly, 500,000/c must give a tail area of 
000499, or (1 +a) must in the usual notation be about 9995. This connotes 
o = 500,000/3'29 nearly, and with the above value of o 


500,000\?__, p, Dr f) 
( 3-29 ) ~ 4g (1+ = ae 


Laplace puts q' = 973,054'5 as the annual number of births in France*, He 
assumes on the basis of previous samplings that p,/q is in the neighbourhood 
of 26; he actually takes 25:5, 26 and 26:5. If we adopt 26, we have 


500,000\2 q’ 
wT) = 973,054 x 26 x 2 
( =o ) 973,054 x 26 x 7(1+4), 
whence I find q =32°8124q, 
and pi = 26q = 26 x 973,054/32°8124 = 771,082. 


* Note the -5, very characteristic of Laplace, especially when his whole process of representing the 
hypergeometrical series is only a relatively rough approximation.. 
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Laplace gives p, = 771,469, sensibly in agreement, Thus according to him the 
relation of births to population ought to be determined on a sample of 700,000 to 
800,000, if the odds are to be 1000 to 1 that we have not an error of 500,000 in 
the total estimated French population. 


Adopting the reasoning of the Mémoire, in which p, and p,’ represent popu- 
lations and gq and q’ births, I can see no justification for Laplace’s method of 
reducing the problem to an urn problem. I see no reason why an additional 
birth in the sample means one fewer member of the population. I see further no 
ground whatever for considering the first sample and France as a whole as 
independent samples from an indefinitely large population. If we take p,=p—q 
as is done in the Théorie, then I think the reasoning must be of this kind: the 
population consists of “new-borns” (as indicated by the births) and “old-borns,” 
and when we pick a sample of a definite size a new-born may be considered to 
exclude an old-born and vice versa. But although there is a reason here—and 
there seemed none in the previous case—I am doubtful of its validity. It assumes 
that all correlation between new-born and old-born is due to sampling errors, and 
that there is no organic relation between the two; that is to say that we suppose 
in any sample that the number of new-borns is independent of the constitution 
of the population of old-borns, for example of their age distribution, which would 
vary from sample to sample, even if the samples were truly random. 


Whether Laplace’s Problem can or cannot be applied to the new-born con- 
sidered as marked individuals in a population, it appears desirable to obtain 
a solution of it and compare it with Laplace’s. I state it as follows: 


Lapicce’sPechlem. A population of unknown size NV is known to contain q’ 
affected or marked members. Zé is 4gsired to ascertain—on the hypothesis of 
inverse probabilities—a measure of the error introduced by estimating N to be 
nx # , where g is the number of marked individuals in a sample of size n. 

q 

It is clear that the problem is a very important one and it occurs repeatedly 
when arguments from samples are applied to populations. If p’ and p be the 
number of unaffected in the population V and its sample n, then by Equation (in) 
above the chance C,,» of a sample from N containing p unaffected individuals is 

| |p’ \q' N—n 


lap ie -plai-q WV 





rebates sicexses eens 


Here n, p, q and q’ are really known and p’ = N — q’, hence 
N-q|\N-n 
x 


- lq Pp f 2 |V _ q =—_ NV j 


|\n lq’ 





C 


/p,p 


Now by the rule of inverse probabilities, the chance of the sampled population 
being N is proportional to ¢ 


‘yp’ = Cy,v'|Spy (Epp). But since a sample n has been 
tested and found to contain p unmarked individuals and N is known to contain q’ 
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e 
marked individuals, V can only vary from p+ q’ to 0. Hence the frequencies of 
the various possible values of V are proportional to the terms of the series 


14 (PAD —at)) , (pt+)) (p42) -7+ DG -9+ 2) 





l!(p+q +1) 2!(ptqt+l)(p+q +2) + etc. .... 
This is a hypergeometrical series of which the sum is 
lgq—2 |p+q 
lq —1jn—1° 


Thus the distribution of V per mille of possible values of NV is 


1000 |q°—1 |n—1 (1 (p+1)(q'-q4 1) (pt+1)(p + 2)(q —q+1)(q' —q+2) 
lq—2 |jp+q L!(p+q+4+1) 2!(p+q+1)(p+q +2) 


+...4 ol rrr esi 
Here the first term corresponds to V = p + q’. 


This hypergeometrical can as before be very closely replaced by a Pearson 
Curve of which the 8, and 8, are given in (xviii) and (xxi) above. If we content 
ourselves like Laplace by replacing the hypergeometrical series by a normal 
curve we have 

" (9 +1)(d —q+1) 
Mean of VN = NV =p -|- 7 + l 1 ; 1 ; 
q—-= 


Mode of NV = N =pt q + 


> 


p(d-9) q 
Si 

q q 

> (PtHD YG -—9+)D@m-1I)q7-1) 


on = ree (xxix). 
N Cee 2)? (¢—3) 
When p, ¢ and q’ are all large numbers, Laplace a ese, then 
<4 tA ad : 
oe ee N aheee a =a N 
qd 
or mean and mode coincide, while 
. TP —D (p+ QM 

oy = 1P\4 1)\t Rect eg) eee (xxx). 


qq 
This is clearly not in agreement with Laplace’s result above (xxvi), although 
ise Théorie which makes p, the difference of the size of the population and the 
uumber of births for the sample is more correct than the Mémotre*. The chief 


Laplace’s p should in our notation I think be p+q, although this is then not wholly in accordance 
. . ¢ F i p , , , . , - 
with his mean, which he takes as p t —our p a4 q', =p’'+q', so that his p’ corresponds with the whole 
q q 
population not new-born. I say not wholly in accord, because when he comes to actual numbers he 
puts p=2,037,615, his enumerated sample population, and not this number less the births, With this 
value for his p, Laplace’s standard deviation is given by (p. 393 of the Théorie) 
2_ (p+) ap (¢+q’) 
e=———_,——_. 
e 

On the basis of his larger sample, Laplace makes the ratio of population to births to be 28°352,845 (!), 
an increase on his former 26. 
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difference however lies in the factor q’—gq, where Laplace’s result has q’ +4. 
This difference is produced by Laplace treating his sampled population as if it 
were a second sample independent of that already taken. This it is not, and the 
two values will only approach when the size of the sample is very small as 
compared to that of the sampled population. In most such cases oy will be too 
large to render a determination of NV trustworthy. 


I venture to think, therefore, that while Laplace’s Problem is most important, 


it does not cover the case to which he applies it, and that his solution of the 
problem itself is not really correct. 





} 
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I. INTRODUCTORY. 


ONE of the most common as well as most important problems which arise in 
the interpretation of statistical results, is that of deciding whether or not a 
particular sample may be judged as likely to have been randomly drawn from a 
certain population, whose form may be either completely or only partially specified. 
We png A term Hypothesis A the hypothesis that the population from which the 
sample = has been randomly drawn is that specified, namely Il. In general the 
method of procedure is to apply certain tests or criteria, the results of which will 
enable the investigator to decide with a greater or less degree of confidence 
whether to accept or reject Hypothesis A, or, as is often the case, will show him 
that further data are required before a decision can be reached. At first sight 
the problem may be thought to be a simple one, but upon fuller examination one 
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is forced to the conclusion that in many cases there is probably no single “best” 
method of solution. The sum total of the reasons which will weigh with the 
investigator in accepting or rejecting the hypothesis can very rarely be expressed 
in numerical terms. All that is possible for him is to balance the results of a 
mathematical summary, formed upon certain assumptions, against other less precise 
impressions based upon @ priori or @ posteriori considerations. The tests them- 
selves give no final verdict, but as tools help the worker who is using them 
to form his final decision; one man may prefer to use one method, a second 
another, and yet in the long run there may be little to choose between the value 
of their conclusions. What is of chief importance in order that a sound judgment 
may be formed is that the method adopted, its scope and its limitations, should 
be clearly understood, and it is because we believe this often not to be the case 
that it has seemed worth while to us to discuss the principles involved in some 


detail and to illustrate their application to certain important sampling tests. 


There are two distinct methods of approach, one to start from the population 
II, and to ask what is the probability that a sample such as ¥ should have been 
drawn from it, and the other the inverse method of starting from = and seeking 
the probability that I] is the population sampled. The first is the more customary 
method of approach, partly because it seems natural to take II as the point of 
departure since in practice there are often strong @ priort grounds for believing 
that this is the population sampled, and partly because there is a common 
tendency to view with suspicion any method involving the use of inverse prob- 
ability. But in fact, however strong may be the & priori evidence in favour of II, 
there would be no problem at all to answer if we were not prepared to consider 
the possibility of alternative hypotheses as to the population sampled; and we 
shall find that it is impossible to follow the first method very far without intro- 
ducing certain ideas of inverse probability—that is to say, arguing from the 
sample to the population. If on the other hand we start boldly with assumptions 
regarding & priort and & posteriori probability, we reach by an almost simpler 
method sampling tests very nearly equivalent to those obtained from the first 
starting-point. Indeed the inverse method may be considered by some the more 
logical of the two; we shall consider first however the other solution. 

Perhaps the most suggestive method of description is to represent = by a 
point in a hyperspace whose dimensions will depend upon the particular problem 
considered ; and to associate the criteria for acceptance or rejection with a system 
of contours in this space, so chosen that in moving out from contour to contour 
Hypothesis A becomes less and less probable *. The frequency with which the 
sample corresponding to a particular point will occur in random sampling from TI 
may be represented by giving to the space an appropriate “point-density.” Thus 
the chance of drawing a sample whose representative point lies within a certain 


* Here and later the term “probability” used in connection with Hypothesis A must be taken in a 
very wide sense, It cannot necessarily be described by a single numerical measure of inverse probability ; 
as the hypothesis becomes ‘‘less probable,”’ our confidence in it decreases, and the reason for this lies in 
the meaning of the particular contour system that has been chosen. 








> 
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region in the space is proportional to the density integrated throughout that 
region, or in other words to the “weight” of the region. The contours are not 
necessarily contours of equal density, but may be surfaces or regions throughout 
which some statistical measure such as the mean or standard deviation remains 
at a constant “level.” Although it is impossible to visualise a space of high 
dimensions, it will be found that with the help of the analogy of a three-dimensional 
density space this method of description is often a very considerable aid to purely 
algebraic discussion. 

Setting aside the possibility that the sampling has not been random or that 
the population has changed during its course *, © must either have been drawn 
randomly from II or from II’, where the latter is some other population which 
may have any one of an infinite variety of forms differing only slightly or very 
greatly from Hl. The nature of the problem is such that it is impossible to find 
criteria which will distinguish exactly between these alternatives, and whatever 
method we adopt two sources of error must arise : 

(1) Sometimes, when Hypothesis A is rejected, = will in fact have been drawn 
from IT. 

(2) More often, in accepting Hypothesis A, = will really have been drawn 
from IT’. 

In the long run of statistical experience the frequency of the first source of 
error (or in a single instance its probability) can be controlled by choosing as a 
discriminating contour, one outside which the frequency of occurrence of samples 
from II is very small—say, 5 in 100 or 5 in 1000. In the density space such a 
contour will include almost the whole weight of the field. Clearly there will be 
an infinite variety of systems from which it is possible to choose a contour satis- 
fying such a condition. For example there will be the system of contours upon 
any one of which (a) the mean, or (b) the standard deviation, or (c) the ratio of 
mean to standard deviation, of © is constant. 


The second source of error is more difficult to control, but if wrong judgments 
cannot be avoided, their seriousness will at any rate be diminished if on the 
whole Hypothesis A is wrongly accepted only in cases where the true sampled 
population, Il’, differs but slightly from TI. It is not of course possible to deter- 
mine II’, but making use of some clearly defined conception of probability we 
may determine a “probable” or “likely” form of it, and hence fix the contours so 
that in moving “inwards” across them the difference between IT and the popu- 
lation from which it is “most likely” that } has been sampled should become 
less and less. This choice also implies that on moving “outwards” across the 
contours, other hypotheses as to the population sampled become more and more 
likely than Hypothesis A. 


* A non-random or biased sample from IT may appear to be a random sample from II’; in practice 
it may be very important to be able to attribute = to IT rather than II’. But clearly no general rules to 
cover such cases can be given, for the position must depend in each case on the possible forms of bias 
that may have occurred. 


Biometrika xxA 
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Both these aspects of the problem must be taken into account. Using only 
the first control, any given sample could always be found to lie outside an ex- 
tremely divergent contour of some system. That is to say a criterion of any 
desired degree of stringency could always be found for a sample. But regarding 
the position from the second point of view, it is seen that there will only be certain 
systems which are of any value as criteria. 


The application of these principles will become clearer when illustrated in the 
cases of particular sampling tests, but it seems well to emphasise at the outset 
the importance of careful thinking in these matters, For example it might 
readily be supposed that if, in sampling from II, a sample of form %, occurs more 
frequently than one of form ,, then greater confidence in Hypothesis A would be 
justified if =, were drawn rather than =,. But we shall see that under certain 
conditions this may not be the case, because in the first event alternative 
hypotheses are relatively far more probable than A than they are in the 
second. It is indeed obvious, upon a little consideration, that the mere fact that 
a particular sample may be expected to occur very rarely in sampling from II 
would not in itself justify the rejection of the hypothesis that it had been so 
drawn, if there were no other more probable hypotheses conceivable. 


II. SAMPLING FROM A NORMAL POPULATION. 
(1) Description of the Fundamental Space. 


We shall discuss first the case of sampling from an “infinite” population in 
which the single variable under consideration follows a Normal Distribution, and 
we shall suppose that this population, I], is completely specified, its mean a and 
standard deviation o being known. We have a sample, =, of n observations, 
X,, X2,...X,, and the hypothesis whose probability we wish to test is that = 
is a random sample from I1; this is Hypothesis A. The methods to be discussed 
are perfectly general, but we have particularly in mind the case of small samples 
where the data are insufficient for the application of the ordinary (P, x’) test for 
goodness of fit. 


Making use of the geometrical method first introduced into this problem by 
R. A. Fisher*, we shall imagine an n-dimensioned space in which we take an 
origin at the point O (a, a,... a) and rectangular axes Oz,, Ow,, ... Ox». Referred 
to these axes, is the point a,=X,—4, a, = X,—-—4a,...a,=X,—a. We now fill 


) 


this space with the density field appropriate for samples of n drawn from II, 
making the point-density at (a, %, ... @,) equal 
1 intee? 2) 
. —~ aa (2° +2 +... $2y : 
D=1/(V2rc)".e 28°" .  ebaieigutalenien cis (i), 
so that the chance of drawing a sample with variates lying within the range 
&, + $da,, ... t, + $dax, is Ddz,,... day. 


* Biometrika, Vol. x. p. 507. 
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In the great majority of problems we are neither able nor do we wish to 
distinguish between ,, #,...%, as differentiated variables, but in the space as 
defined there will be x! points corresponding to a given unordered set of » values 
of the variable. Each of these points will lie in a different region of the general 
hyperspace ; one such region will be that for which 

Wee Wires cS ig can ccwenwsinaeescaseannennueeres ai), 
which may be described as a wedge-shaped region lying between n—1 primes 
passing through the line 


These n! regions are identical in shape and density distribution and one may 
be superposed upon another or all combined together by a rotation about the 
axis (iii)*. The contour surfaces of equal density and the other contours that 
will be considered are all figures of revolution about this axis, and it will follow 
that any conclusions which may be drawn regarding the integral of the density 
taken throughout regions lying between such contours will apply to a single 


D 





O 








Fic. | 

segmental region as well as to the complete space. For simplicity in treatment 
we shall therefore consider the latter space, which may be termed the fundamental 
space, bearing in mind that in practice it is generally only possible to locate = in 
a single region such as that defined by the conditions (ii). This region may be 
supposed filled with a density obtained by superposing the »! similar regions. 

Denote the mean of = by a + m and its standard deviation by s. The section 
of the fundamental space by the two-dimensioned plane which passes through the 
line (iii) and the point P representing = is shown in Figure 1. O is the origin, 
OD the line (iii), and # the point where this line is cut by 

Ba oc ptt AME sco ncceanssansdeanonasecenees (iv). 

* In the case n=2, the regions are the halves of the x, x, plane divided by the line #,=2,. For n=3, 

there are six regions, each being a “ wedge” or ‘‘slice” in three-dimensioned space lying between two 


planes which cut at an angle of 60° in the line z,=2,=2x3. We shall use the term “ prime” to describe 





any linear function of the n variables. Such a locus is a “flat” space of n—1 dimensions lying in 
n dimensions. 


12—2 
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The prime (iv) passes through P and lies at right angles to OD, so that the 
angle PEO is a right angle. Further P is the point (X,—a, X,—a,...Xn—«@) 
and £ the point (m, m, ... m), so that 


OP*= 8S (X,-—a)=n(m'+s°), 
t=1 


OF? = nm? and consequently LP? = ns*. 

Hence if we reduce the scale of this section of the hyperspace in the linear 

ratio of 1/Vn, we shall obtain a plane in which the position of the point corre- 

sponding to P is given by rectangular coordinates (m, s). This plane will be 

termed the (m, s)-plane. Any contour in the hyperspace, whose equation can be : 
expressed in terms of the two variables m and s only, may therefore be obtained 

by a rotation of the corresponding curve in the (m, s)-plane about the axis (iii), 

° ‘- ° J 
and a linear enlargement in the ratio Vn: 1. 
Consider now the following : 


(a) The point density D in the hyperspace may be written 





n (m? + s*) 
= _4y2 
D=constant x e 20 = constant x e~ 2X, 
nm 
where* yor = S (X,—a) = n(m? +8*). 
t=1 
(b) The contours of equal density are (n — 1)-fold hyperspheres centred at the ) 


origin O. 

(c) If the two-dimensioned (m, s)-plane be rotated in m dimensions about the 
axis (ili), the point P will trace out an (yn —2)-fold hypersphere lying in a 
prime perpendicular to the axis of rotation ; the boundary surface of this hyper- 
sphere is proportional to s"~*. 


Consequently the integral of D taken between any contour surfaces in the 
fundamental space which can be obtained (after enlargement) by a rotation of 
curves lying in the (m, s)-plane, will be the same as the integral of d between the 
corresponding curves in that plane, where d is a point-density assigned to the 
plane measured by 

n (m* +- 8”) 


d = constant x s”— e 20" 


or choosing the constant so that 


+a ce 

| | d.dm.ds=1, 
—o 0 
n 








2 oe n (m? + s*) 
] n o> = 32 
— ———_ al 7 
a= —s =o CS es ewceeseesteees (v). 
es n—1) 
Vr2? 7 ( x) 


* This x? must not be confused with the x? of the Goodness of Fit Test. 
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The equations of the five contour systems defined below can all be expressed 
in terms of m and s. In order to grasp their bearing on the problem of testing 
Hypothesis A, it is necessary to consider their form and position in the funda- 
mental space, but their reduction to the (m, s)-plane makes clearer their relation to 
one another and enables a ready integration of D to be obtained in certain regions 
of the hyperspace. 


These contours, which are shown diagrammatically in the (m, s)-plane in 
Figure 2, are as follows: 


(1) The contours of equal density D (contours of y). These are semicircles, 
centred at O, of radius yo/Vn = Vm? + 8°. 






$- cortour 

















D’ O< m. > 





Fic.2 


(2) The contours of constant m; a series of lines parallel to the axis of s. 
(3) The contours of constant s; a series of lines parallel to the axis of m. 


‘4) The contours upon each of which the ratio of z= m/s is constant (contours 
i 
of z). These are a series of straight lines radiating from O. 
o 


(5) The contours of equal density d, or of equi-probable doublets (m, s). They 
form a series of oval curves in the (m, s)-plane with equations 


(n — 2) log, s — 4m (m? + 8°)/o27= K cece seceeseceeseeeees (vi), 


; x : ° / é 
and surround the point of maximum density G, or m=0, s = ao V(n — 2)/n. 
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By integrating d outside the first four contours in the (m, s)-plane we obtain 
the four standard integrals for the chance, P, of drawing a sample with x, m, s, 
or z, equal to or greater than the value observed : 


on 
(1) Prey] yt te W dy cssssssesssssseeseerees (vii), 
Ix 
: nm? 
(2) Pyu=Ce| € oj RE Oe Oe (viii) 
m 
(3) The distribution of s being skew, we may take 
~ ns 
P.=¢ | Pb Pe eos cricson (ix), 
8 
- a ns 
or P/= sf ste 2 ds........ es EAT PES te MT (x), 
0 
according as s is greater or less than the modal value s, = oV(n— 2)/n. 
00 n 
(4) P,= 0 | CE Ee PRE: ccccamicets ceteris (xi), 
and finally, for the chance of obtaining a less probable doublet (m, s), 
_B(mi +s") 
(5) Pus ete ™ dmde ...... siahiataieae (xii), 
Ja 


where 


is to be taken outside the contour of system (vi), which passes through 
J/a 


the sample point. This integral has not so far been computed. 


The form of the contours in the fundamental space obtained by rotating 
Figure 2 about D’OD in n dimensions can be easily visualised in the case of 
n=3. The y-contour becomes a sphere, the m-contour a plane at right angles 
to the axis, the s-contour a cylinder, the z-contour a cone, and the (m, s)-contour a 
hollow ring. 

The foregoing results are not of course new, but this restatement has seemed 
necessary in order to make clear the discussion which follows*. 


(2) The Use of the Contour Systems in testing Hypothesis A. 


In the first place it is clear that any of the five contour systems described 
above will enable us to control the first source of error referred to in the intro- 
duciory section. By rejecting Hypothesis A only when the point representing = 
falls upon a contour for which P (whether Px, Py», ...) is, say, <°01, we shall be 
certain in the long run, in random sampling, of only rejecting the hypothesis 
when > has in fact been drawn from II, in 1 case out of 100. But the samples 
rejected and those included will vary according to the system chosen. Consider 
the systems individually. 


* See, for example, Student, Biometrika, Vol. v1 
p. 507, 1914; Metron, Vol. v. No. 3, p. 3, 1925. 


> 


. p. 1, 1908; R. A. Fisher, Biometrika, Vol. x. 
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(2), (3) and (4), contours of m, s and 2, 

If we wish to judge whether a certain deviation in the mean, or a certain 
standard deviation, or a ratio of the two, is likely to be found in sampling from TI, 
these systems are entirely satisfactory. For instance, the smaller be P,,, the 
more divergent is the sample mean at a+m from the population mean at a, the 
less frequently will a sample with mean in the limits a +m + 4dm be drawn in 


random sampling, and other things being equal the more likely become alternative 
hypotheses as to the value of the population mean. 

But if our purpose is to make full use of the information supplied by =, and 
to ask whether, taken as a whole, it is likely to have been drawn from II, the 
contours of m, s, and z are inadequate. In the first place they are not closed 
contours, for the space bounded by any one of them stretches away to infinity ; 
this is true of the fundamental space as well as of the (m, s)-plane. Consequently 
the space on the inner side of any contour, however near to unity be the P 
associated with it, will contain points representing samples which are infinitely 
divergent from the population type. Therefore in using one of these systems alone 
as a criterion we should sometimes find ourselves being led to accept samples for 
which our common sense tells us that Hypothesis A is extraordinarily improbable. 

It will also be seen that, while in the fundamental space the contours of the 
criteria of (viii) and (ix) for m and s contain the central region of the density 
field, the z-contours are orthogonal to the surfaces of equal density, and select 
out for rejection in a quite arbitrary fashion, the sample points lying at right 
angles to the axis (iii)*. 

(1), contours of x fF. 

These contours alone have the property of coinciding in the fundamental 
space with the surfaces of equal density, so that any contour divides off samples 
of more from those of less frequent occurrence. P x will provide a satisfactory 
measure in any problem where the judgment of the statistician is aided by a 
scale which expresses in numerical terms the fact that, in drawing from a given 
population, a sample is of relatively common, rather rare, exceptionally rare occur- 
rence, etc. But in the great majority of problems we cannot so isolate the relation 
of = to IT; we reject Hypothesis A not merely because = is of rare occurrence, 
but because there are other hypotheses as to the origin of } which it seems more 
reasonable to accept. This attitude may be illustrated by the following example. 

Suppose that IT is “normal” with mean at the origin of reference and ¢ = 1000, 
and that = is a sample of 8 for which the variate values are as follows : 

+10, +04, —0°6, +01, —1:1, —03, —O1, +0°2. 

* This criticism of the z-contours does not of course apply to their use in Student’s Problem, where 
the standard deviation of II is unknown, which is, discussed below. In the case where o is known, how- 
ever, they provide a good illustration of a system which gives for any sample a measure FP. 
completely useless as a criterion to apply in testing Hypothesis A. 

+ The x-contours here considered are of course quite different from those occurring in the (P, x’) test 
for goodness of fit, or what may be termed the “y? group test.” Here the variables are x,, x, ... & 


there they are the group frequencies nj, no, ... ny. 


5» Which is 


n? 
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Here x? = ‘Ss (X,—a)/o? = 000288. 
t=1 
Px is therefore very close to unity, the sample point lying almost at the centre 
of the density field. Yet unless the investigator had very strong & priori grounds 
for supposing that the standard deviation of the sampled population was 100°0, he 
would instinctively reject Hypothesis A, feeling that some other hypothesis, that 
gave o in the neighbourhood of 0°5, was far more probable. This would appear to 
be a decision of common sense; we shall seek later to express if in more exact 
terms. 


(5), the (m, s)-contours or contours of constant d. 


These, just as the x-contours, have the advantage of being closed, and if we 
take as the criterion of rejection P,,,,< ‘01, let us say, we shall not accept samples 
for which Hypothesis A seems exceedingly improbable on any common-sense 
grounds. As only the most divergent of the contours approach the origin O, we 
should reject Hypothesis A in the case of the numerical example given above. 
Further, since in sampling from a Normal population the mean and standard 
deviation may be considered as the two most important descriptive constants, 
the (m, s)-plane has a certain fundamental importance and it seems rational to 
adopt the contours of equal density in that plane. At any rate in making use of 
them we are applying what may be termed a “two constant.” test, as distinct 
from the less adequate “ one constant” test of either m or s alone*. But it must 
be remembered that it is difficult to find any logical reason for accepting the 
contours of equi-probable doublets (m, s), rather than those of the doublets (m, s*), 
or (m, s*), ete. For a given sample P»,,: will differ from P,,,,; the contours of the 
former in the fundamental space are obtained by rotating the oval curves in the 
(mM, s)-plane for which 

(n — 3) log, s — 3m (m? + 8*)\/o2 = Ko... ccccc eee eeeeees (xiii), 


an equation differing from (vi) in the factor n —3 instead of n — 2 


We must in fact consider whether it is possible to find a contour system which 
will take into account the probability of alternative hypotheses, and will not 


depend for its validity on the particular statistical constants chosen to describe 
the sample. 


(3) The Criterion of Likelihood. 


Suppose there to be two hypotheses regarding the population from which a 
given sample has been drawn; there will be one density field for Hypothesis A 
corresponding to I, and another for Hypothesis A’ corresponding to II’. By 
surrounding the sample point in the fundamental space with an element of volume 
dX,, dX,,...dX, we have in the ratio of the “weights” of the elements a measure 
of the relative frequency of occurrence, on the two hypotheses, of samples lying 


* It will be seen that if both the m-test and the s-test are applied to a sample, and the Hypothesis A 
is rejected if, let us say, |m| > 3c,, and |s—o| 


- 3o¢,, the criterion of rejection will form a rectangle in 
the (m, s)-plane corresponding very roughly to the oval of the (m, s)-contours. 
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within the prescribed limits. These elements may seem a little artificial since we 
know in fact that the n variates of the sample have each unique values, even if 
these cannot be determined exactly. It will therefore be well to consider the 
position more carefully. 


The possible range of the variable «= X —a@ may be broken up into small 
subranges of equal length h. The n variates of the sample will each fall into one of 
these subranges, and the corresponding point P in the fundamental space will fall 
into one of the n-dimensioned hypercubes of side h which make up the framework 
into which the space will now be divided. As pointed out above, on p. 179, for 
a given set of unordered values #,, #, ... #, there will be a large number of hyper- 
cubes in the complete space into which P may fall. If », of the variates fall into 
one subrange, n, into another and so on, where n,-+ m +...+2=n, then the 
chance of drawing a sample with its representative point in one of these cubes is 


C.——— DS Rent sabe seabemuseen ee een (xiv) 
. n,! No! ... met! 


where p is the “ weight” of one of the cubes. If none of the variates falls into the 
same subrange this expression reduces to n! p. 


If we confine our attention to a single one of the n! segmental regions such as 
that defined by (11), we may consider the form of (xiv) as due to a boundary 
effect. When several of the a’s fall into a single subrange, only a portion of the 
corresponding hypercube or 1/(n,! n.!... mg!) lies within the region (ii), part being 
cut off by one or more of the n—1 primes bounding the region. 

As h/o is decreased so as to make the block surrounding P a differential 
element of volume, this boundary effect becomes negligible, none of the n variates 
falling into the same differential subrange. On the other hand as h/o is increased 
the effect becomes more and more important, until all the hypercubes within the 
field of significant density cut a considerable number of the bounding primes. In 
the first case we measure the chance of drawing a sample having the n variates in 
the range 

a, + th, ... a4 th, 
by FS (Ga), F (2)y +++ fF (Gen) A®....- 200000 fieen dynaiickedoeays (xv), 
where for the Normal population 


2 0° 


f(a) =1/V200.e ~*; 
in the second case, by a term of the multinomial expansion, namely 


= SY sic EE acpi ekcanencne ven neniesente (xvi) 
n,! nq! ... mp! 
asthh : 
where D, = J («) da. 
+ X- th 


Now it can be shown that when h is a finite subrange, a comparison of the 
probability of two hypotheses obtained from two .expressions such as (xvi) leads 
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directly to the ordinary (P, x*) test for goodness of fit*. It would seem therefore 
that by supposing that each observation lies somewhere within a differential 
element of fixed length, and comparing expressions of form (xv), we employ a 
method which differs only in degree from that used in cases of broad grouping. 
The use of a grouping unit of fixed length+ whatever be the hypothetical popu- 
lation is common to both methods; the difference consists mainly in the fact 
that in one case the grouping unit is so large that many variates will generally 
fall into each group, while in the other the great majority of groups are vacant, 
and those filled contain only one observation. 

We shall therefore take the ratio of two expressions such as (xv), with a 
common /, that is to say the ratio of the point-densities at P in the two fields, as 
a measure for the comparison of Hypotheses A and A’. Probability is a ratio of 
frequencies and this relative measure cannot be termed the ratio of the probabilities 
of the hypotheses, unless we speak of probability & posteriori and postulate some 
& priort frequency distribution of sampled populations. Fisher has therefore 
introduced the term likelihood, and calls this comparative measure the ratio of 
the likelihoods of the two hypotheses}. For a given sample and two given 
hypotheses, the ratio of the likelihoods is completely independent of the co- 
ordinate space in which the sample point is represented. For example the ratio 
of the densities of the two fields at P is the same in the space Of 2, %_,...% a8 In 
that of a,*, #,*,...«,°. This is a point of considerable importance. 

As an illustration, return to the example of p. 183. We may suppose that 
there are two hypotheses as to the population: (A,) Mean at 0, ¢ = 100-0, and 
(A.) Mean at O,o = 05, Then the two corresponding densities at the sample point 
are (A,) 6°42 x 10-*, (A.) 5°18 x 10~ and the ratio of the likelihood of Hypothesis 
A, to that of A, is 1°24 x 10~-". This ratio confirms the common-sense judgment 
that A, is far more plausible than A,. 


There is little doubt that the criterion of likelihood is one which will assist the 
investigator in reaching his final judgment; the greater be the likelihood of some 
alternative Hypothesis A’ (not ruled out by other considerations), compared with 
that of A, the greater will become his hesitation in accepting the latter. It is 
true that in practice when asking whether = can have come from II, we have 
usually certain @ priort grounds for believing that this may be true, or if not 
so, for expecting that II’ differs from II in certain directions only. But such 
expectations can rarely be expressed in numerical terms. The statistician can 
balance the numerical verdict of likelihood against the vaguer expectations de- 
rived from & priort considerations, and it must be left to his judgment to decide 

* This point will be discussed later in connection with the (P, x?) test. R. A. Fisher has dealt with 
it in considering the relation between Maximum Likelihood and Minimum x?, Phil. Trans. A, Vol. 222, 
p. 357; Journ. of Roy. Stat. Soc, Vol. uxxxvii. p. 446. 

t It might be supposed that the element should be made to depend upon the parameters of the hypo- 
thetical population. But if, for example, k were taken to be proportional to c, it is seen that the Normal 
population, for which the chance of drawing a sample with observations in the prescribed limits is a 
maximum, will have c=. Such an interpretation of the element would be clearly of little value. 

t Phil. Trans. A, Vol. 222, p. 326. 
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at what point the evidence in favour of alternative hypotheses becomes so con- 
vincing that Hypothesis A must be rejected. 

As for all criteria, there is a limitation to its value in the case of very small 
samples. If all hypotheses were equally probable & priori, the criterion provides 
a means of picking out the most likely ones & posteriori, whatever be the size of n. 
But if we have as is usual some @ priori grounds for believing in A, we shall 
not feel disposed to give up this belief unless the alternative suggested by the 
method of likelihood is what may be termed a stable alternative. For example, in 
the previous illustration, while it seemed reasonable to reject Hypothesis A upon 
the evidence of a sample of 8, we should not do so if we had only a sample of 2 
with values + 0°4 and — 0°6, because we should be quite ready to find on adding a 
third observation to the sample that it had a value of +50°0 or — 100°, or of 
almost any quantity whatsoever. Here the alternative of a population with o =05 
does not appear sufficiently stable to be very seriously considered. 

(4) The Application of the Criterion of Likelihood in testing Hypothesis A. 

We shall suppose that while there is uncertainty as to the values of the mean 
and standard deviation, a and o of the population sampled, there are good reasons 
for believing that = has been drawn from some normally distributed population. 
This is not a necessary limitation of the general theory of likelihood, for it is 
possible, in theory at any rate, to compare the point-densities of any types of field 
whatsoever, but we consider here only the simplest case*. Taking X as the mean 
of = referred to a fixed origin and s as before, its standard deviation, the density 
at P of the field associated with some population II’, mean at a’, and standard 
deviation o’, is given by 


‘ n er eee e 
i I e ee I cell (xvii). 
(V2Qa0’)" 
To find the IT’ for which D is a maximum we solve 
oD < : 
-=0 or X =a’, 
oa 
oD n 1 n{(X—a'¥+8%} 
A ~=0 or -—--; + 5 i Dead 
0c o Mt ao” co? 
giving o?=(X —a’P+8%. 


The population of maximum likelihood + is therefore that for which a’ = X 
o’ =s, and the corresponding field gives as the density at the sample point P, 
n 
Big TA 8 pcccccvicmncvanen (xvili). 
From this we obtain the ratio 


Likelihood of TI D (2) : (“3 = 1) 
- e 2\ ¢ 


= aceite = er el 
Likelihood of I’ (max.) Dy, 


* The effect of a moderate divergence from normality is considered experimentally below. 
+ It can be shown on a further differentiation that this is a maximum solution. 
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This ratio remains constant upon a certain contour surface in the density 
space of II. That is to say, the ratio of the likelihood of II to the maximum 
likelihood that can be associated with an alternative normal population is constant 
upon this contour, but steadily diminishes on passing outwards from one contour 
to another of the system. The surfaces are obtained by the rotation about the 
axis (111) of the following curves in the (m, s)-plane: 

(m* + s*)/o? — log, (s*/o*) = 1 — (2/n) loge A =k log. 10 ......... (xx). 

This equation represents a series of oval curves surrounding the point m= 0, 
s=a; the particular contour is determined by the constant k, and as n appears 
only on the right-hand side of the equation it will be seen that the form of the 
system as a whole is independent of the size of the sample. For large values of n 
the curves will differ only slightly from those of the contours of equi-probable 
doublets (m, s) given in (vi). In the fundamental space the surfaces will be closed 
contours which will not contain the origin. For the case n=3 it is possible to 
visualise the system of hollow rings obtained by rotating about DOD’ curves 
much like the (m, s)-contour drawn in Figure 2. 

Without claiming that this method is necessarily the “best” to adopt, we 
suggest that the use of this contour system in association with the density field 
of II provides at any rate one clearly defined method of discriminating between 
samples for which Hypothesis A is more probable and those for which it is less 
probable. It is a method which takes into account the likelihood of alternative 
hypotheses, and while its justification does not rest on the choice of any particular 
statistical constants, the close agreement as n becomes large of its contour system 
with that of the equi-probable doublets (m, s) provides another form of inter- 
pretation. 3 

To employ the method, it is necessary to know 

n (m? +s”) 

Py. =c; [ oe" ¢ MOT SEE. ec eaccak dc endscous -o( ZX1); 
an expression corresponding to (xii), except that here the integral is to be taken 
outside the contour of (xx) which passes through the point (m, s) corresponding in 
the (m, s)-plane to the observed sample, =. If then we only reject Hypothesis A 
when P, is, let us say, <‘01, we can control the error of form (1)*, while the use 
of the contours of likelihood will minimise as far as possible the effect of errors of 
form (2). It would be possible to use the ratio » of (xix) as a criterion, but this 
without a knowledge of P, does not enable us to estimate the extent of the 
form (1) error, 


Tables of P, form =3 to 50 are given in the Appendix. These are entered 
with k, which may either be calculated for the case of a given sample exactly 
from (xx), or found readily with sufficient accuracy for most practical purposes 
from the diagrams of Figures 13 and 14. The method by which the Tables were 


See p. 177. 
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computed and also the manner in which P, can be obtained for higher values of 
n are described in the Appendix. Illustrations of the use of the Tables are given 
in Section (9) below. 


(5) Student’s Test and Hypothesis B. 


This is the problem to which Student's z-test has been applied*. The sample, 
>, of n observations X,, X.,...X, has a mean X and a standard deviation s. 
There is every reason to believe that ~ has been drawn from some normally 
distributed population, and we wish to test Hypothesis B—namely, that = has 
been randomly drawn from such a population with mean a, but unspecified 
standard deviation. 

In the fundamental space the axes and the position of the point P repre- 
senting = are as before, but the density field appropriate to II depends upon the 
unspecified o. Of the contour systems considered above only those of z are 
independent of o, and it might therefore appear at first sight that here was 
sufficient justification for the use of Student’s integral, P,, to test the probability 
of Hypothesis B. But B is really a multiple hypothesis concerning the sub- 
universe of normal populations, M (II), with means at a and with varying standard 
deviations, It only becomes precise upon definition of the manner in which @ is 
distributed within this sub-universe, that is to say, upon defining the @ priori 
probability distribution of o. Although it is true that whatever this may be, the 
distribution of z in sampling from J/(II) will be the same with the probability 
integral given by (xi), it does not follow that the z-contours will therefore be the 
appropriate contours to use in testing the hypothesis as a whole. If it were 
possible to determine an & priori distribution for oa, 

= 


y = $(c), where | PRO PS ion. scarverconeeences (xxi), 
/0 


then we might fill the fundamental space with a density field appropriate to 
M (II) given by 
1 


1 ro — =, S(X,-a)? 
- d(a)je 2 sii 1” 


= On =}, of DB cisct sas ocewan (xxill), 
and consider what in this were the appropriate contours to use. In practice, 
however, it is doubtful if ¢(¢) could ever be determined, and if some arbitrary 
form is to be assigned, it is simpler to approach the problem from the point 
of view of inverse probability, and think of the density space as associated with = 
rather than (II). This will be done in a later section. 

In a problem where Hypothesis B is to be tested, the investigator will usually 
be quite certain that o cannot have all values between 0 and o. He may often 
be able to define broad limits outside which it is impossible that it should lie, and 
to suggest narrower but less clearly defined limits within which it most probably 
lies. But although he might say, “As far as I know, o may lie equally well 
anywhere between 6, and 6,,” he would probably find it difficult on reflection to 


* Biometrika, Vol. vi. p. 1 et seq. 
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justify this statement, rather than, “As far as I know, o*? may lie equally well 
anywhere between b,* and b.2.” In fact, his & priori information as to @ is generally 
not sufficiently precise to bear exact numerical expression. Under such conditions 
the criterion of likelihood will probably be of service. 

Corresponding to any given =, we can find the population IT out of MM (11) for 
which the likelihood is a maximum. This is obtained from (xvi) by equating 
dD/de’ to zero, considering a=a’ as fixed. We find o? =m? +s*, and for this 
population the density at P is 


1 1 “ 
(27r)"? (am? + s*)n? 

We can now obtain the ratio of (a) maximum likelihood for a member of 
M (11), to (6) maximum likelihood for any normal population (the D,, of (xvi)), or 


0 


neesbavaonnts (xxiv). 


n n 


oo 2 2 Pe ae 
% “> (=) CEA By Scacsandeceesavere (xxv). 


This ratio is constant along the contours of z. It follows that if the sample 
point is surrounded by an element of volume, the ratio of the maximum frequency 
with which a sample lying within these limits will be drawn from a normal 
population with mean at @ to the maximum frequency with which it could be 
drawn from any unrestricted normal population whatever, is constant upon one of 
these contours. Speaking therefore in somewhat loose terms, we may say that, 
other things being equal, we shall be more ready to accept Hypothesis B for a 
sample lying inside or on a certain z-contour, than for one lying outside it. And 
we may measure our confidence in a decision to reject or accept the hypothesis by 
P., which is the integral of the density outside the contour whatever be o. 
Student’s test was originally devised to allow for the use of s instead of o in 
testing the significance of a deviation in the sample mean. We have above 
another interpretation of it as a method of testing the sample as a whole, based 
upon the criterion of likelihood, which will be valid as long as that criterion can 
be employed. 

Whilst the @ priort knowledge regarding the value of a remains loosely defined, 
the open z-contours are probably the best that can be used. But if there are 
grounds for believing that o must lie within a limited range, the contours which 
it would be natural to adopt become deflected until, when & can be almost exactly 
located, they will form a closed system tending to the A-contours. Suppose, for 
illustration, that in a sample of 10 there is a mean of 20°4 and a standard deviation 
of 3°6. We may ask whether it is likely that this sampie has been drawn from a 
population with mean of 18°0. As long as there is no & priort information which 


enables the value of a to be closely determined, or which is contradicted & posteriori 
by the sample value of s = 3°6, it seems reasonable to apply Student’s test. But 
if there were grounds for believing that o could not exceed 1°5, the knowledge 
that for z=%, P,='26 would be of little value in solving the problem in hand. 
This point is illustrated more fully in the first example of Section (9) below. 
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(6) Alternative Method of examining Hypothesis B. 
Take as variables M = = and S= =, and suppose Figure 3 (p. 192) to represent 
the (M, S)-plane. The X contours of (xx) become 
M? +. S* — log, S* = blog, 10 ..........s020s00e00 .(xxvi), 
and surround the point C (7 =0,S=1). Since the point-density in the (m, s )-plane 
is given by (v), that in the (M, S)-plane will be 


d’ = constant x S™2e 25 7" 0... poeaeeten ee (xxvil) ; 
consequently this plane with density d’, and the contour system (xxvi), can be 
looked upon as a fixed field of reference whatever be o. In testing Hypothesis B, 
we cannot locate in the plane the point P corresponding to X, but only the radial 
line O7' somewhere on which it mus% lie, where 

M/S= m/s = z= cot TOD. 

P, is the integral of the density throughout the sector TOD of the plane. If 
ao were known, the point P could be identified, falling let us suppose at Q. We 
could then find P,, the integral of d’ over the region lying outside the \-contour 
drawn through Q. P, will clearly be a maximum if P were found to fall at R, the 
point where O7' touches a member of the A-system. This has been seen from 
a different method of approach* to occur when o? = m? + s*, or M?+S8?=1. Con- 
sequently in Figure 3, OR? = 1 = OC*. 

We may now argue as follows. For a given =, the most favourable case giving 
a maximum value to P, arises when we suppose the sampled population from 
M (Il) to be I, with o=Vn?+s. If, on giving this value to o, Hypothesis A 
judged by P, and the criterion of likelihood is improbable, then it will be more 
improbable for every other normal population with mean at a Hence we reject 
Hypothesis B. If on the other hand P, is such that Hypothesis A appears reason- 
able for population II,, then Hypothesis B can hardly be rejected, because while it 
may not be possible to assert that it is most probably true, we know that for 
values of o in the neighbourhood of Vm? +s? the sample & is not exceptional. To 
the criticism that there is no justification for assuming that I], is the population 
sampled, it can be answered that unless some & priori distribution of « be postu- 
lated, there is no justification whatever for assuming that II, or a population 

* On p. 190 above. The result may also be proved directly as follows: 

The contour equation is 

¢ (M, S)=M?+ S?-lo 

and the tangent to this at the point (W,, S,) will be 


S?=k log, 10, 


o 
de 


, 0 ao 
S-—S,= M-M,)s —— 2 
. ; (= 1) on, os, 
tol 0p . 2 
3 2) =~ =28, - 
But aM, 3 aS I~ s? 


so that the tangent becomes 
S—S,=(M-—- M,) M,S,/(1- 81). 
As this is to be the line OT passing through the origin, 


S,=M,2S,/(1- 81°) or Mj2+8;2=1, m?+s?=07. 
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differing slightly from it is improbable. The greater the range of values of o 
above and below Vm? + s*, which give reasonable values to P,, the more probable 
becomes Hypothesis B, but no attempt is made to give this probability a unique 
numerical measure. 

It will be seen from Figure 3 that if Student’s test be applied, 2P, is the 
integral of the field between the axis of M and the tangents O7, OT". Clearly 
2P,< P, (max), and the one will be a single-valued function of the other. It 
follows that if we were to reject Hypothesis B whenever P, (max) <a, we should in 
fact always reject it for samples for which z>y and 2P.< £8, where B<a. If 
then a statistician thoughtlessly decides, whatever be the test, to reject an hypo- 
thesis when P < ‘01, say, and accept it when P >°01, it will make a considerable 
difference to his conclusions whether he uses P; or P, (max). But as the ultimate 


F 


T’ T 


ec 








D’ Oo D 
Fic.3 


value of statistical judgment depends upon a clear understanding of the meaning 
of the statistical tests applied, the difference between the values of the two P’s 
should present no difficulty. The difference in the two scales of probability 
simply corresponds to the difference in attitude of mind with which the problem 
is approached; the use of P, (max) perhaps corresponds to the more cautious 
attitude, but it would seem impossible to claim that one approach is the correct 
one and the other is erroneous. 


(7) Solutions obtained by the Inverse Method. 


The inverse method of approach is to start from ¥ 
the one certain fact in the problem. 
samples but populations from which } may have been drawn; that is to say, the 


density field is made appropriate to % and not to II, and we shall reject an 


x, the observed sample, as 
Points in the space no longer represent 
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hypothesis concerning the population sampled on entering regions of the field in 
which the density falls below a certain level. This alternative point of view is of 
some interest, and although it leads to tests which are almost identical with those 
reached above, we shall discuss it in some detail in the present case of sampling 
from a normal population. 

We must again assume that = has been drawn from some normally distri- 
buted population, and as this can be described completely by its mean, a, and 
standard deviation, ¢, the whole discussion may be confined to the two-dimensioned 
(a, c)-plane. Referred to a fixed origin in this plane and rectangular axes, = is the 
point G (X, s) and any population I] a point P (a,c). The position is illustrated 
in Figure 4. 

The ordinary method of inverse probability consists in postulating some 
function ¢ (a, o) to represent the probability & priori that the sampled population 
is II, and taking for the point-density of the field 


8 = ie 
— gg 1X a)* + s*} 


D = constant x g(a, 0) x oe BPO kee eee (xxviil). 
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D.da.dea would then be termed the probability 4 posteriori that the popu- 
lation sampled has a mean and standard deviation in the range, a + 3da, o + }do. 
The difficulty of this procedure in any practical problem lies in the fact that it is 
almost impossible to express ¢ in exact terms. We prefer therefore to follow 
a line of argument which while really equivalent to the above with ¢ assumed 
constant makes use of the principle of likelihood rather than the somewhat vaguer 
conception of @ posteriori probability. 

The likelihood of II may be written 


nN os eal 
f - X — a)? +8"} . 
L« D=(1/N2r)" o-"e liad MOT Le (xxix). 
Population points such as P of Figure 4, for which Z is constant will lie on 
closed curves, - 
log. o? + [(X —af+s*}/e=h/n .......... vavensauadas 


Biometrika xxA 13 











194 On the Use and Interpretation of Certain Test Criteria 


which surround the sample point G. Give to the field a point-density D. If we 
are right in believing that the value of the likelihood is a useful criterion, 
then ZL (or D) itself would provide us with a measure of the confidence to be 
placed in an hypothesis as to Il. This quantity as it stands is not however 
measured on a very suitable scale, for it is difficult to know how to interpret the 
very smal] values that LZ will generally take. Let Z be the integral of D taken 
throughout the whole (a, )-field*, and J’ throughout that portion of it which lies 
outside the member of system (xxx) passing through P. Then the ratio I’/I 
appears to be a suitable measure to use. It is the same for all populations of a 
given likelihood and it varies between 0 and 1; when it is very small we know 
that the likelihood of II stands very low in the scale, and when large that L 
approaches the maximum value. Likelihood as defined by Fisher is a quantity 
which cannot be integrated+. With this in the strict sense we agree, but look 
upon the relation J’/J =f(Z) as a form of transformation providing as a criterion 
a function of Z placed on a scale more easily understood in common terms. 
We can now write 
r n (xX 24 92) 
P, = I'/T=6, | ao"e 20° cnitiaslei ON Seema (xxx1), 
Jw 
where the integral is taken outside the member of (xxx) passing through (a, o), 
and ¢, is a quantity depending only on n and s, which is chosen so that Py~=1 
when a= X, c=s. 
It is of interest to compare the values of 
Pm,s, (xii); Pa, (xxi); and Py,,, (xxxi). 
Each is a function of » and the four quantities, X,, 8, @ and o,, which define the 
particular = and II with which Hypothesis A is concerned. Further, they are each 
integrals taken in a two-dimensioned density field over regions outside an oval 
contour. They may be brought into comparable form by means of the following 
transformations. 


(a) } ae 


Transformation miao,=u, s/o,=v, dmds=a7dudv. 
Take ky = n {log (s:1/¢,%) —[(X,— a) + s/o. 
The density in the (u, v)-plane becomes, on transforming (v), 
* (u? + v%) 
—=(u?+v" 
POE I aekaccciusincanancucess (xxx1i). 
P»,s 18 obtained by integrating D’ outside the contour 
(n — 2) log v? — n (u? + v?) =k, — 2 log (8.2/0.7) .........(xxxii bis), 


* It is not necessary to suppose this to range from a= +@ to —», s=0 to 


+o. The conclusions 
will not be modified if we suppose that the field is bounded by one of the contours of (xxx) which 
contains, let us say, ‘999 of the density field. 

+t See Phil. Trans. A, Vol. 222, p. 327. 


“ec, 


eet : 2 i“<s —s 
The integral I does not equal unity, but (" ; )/ [Fir * a? s**t}. 
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(b) 


The transformation is as for P,,,,, and the density in the (u, v)-plane is as 
in (xxxil). The integral is to be taken outside the contour 


8 log of — 2 (a? + 0) mB, ......0.0cccscceeseceees (xxxiil). 


This corresponds to (xxvi). 


© Pas 

The transformation is (X,—a)/o =u, s,/o=2, 
whence a= X,—su/v, c=8,/v, 
and dado = J .dudv= s;/v* dudv. 


Then the density becomes, using (xxix), 


Bo 45% a 
—3 (u* + v*) 























Dp ayer F leer (xxxiv). 
u 
’ 
ite . 
rex | , 
EB 
—U +U 
MX5-a—+0 Oo” MSM 
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And the contour, using (xxx), is 
log (s,2/v*) + wv + v = log a? + (X, — &)!/o2+ 82/o8, 
or n log v* — 20 (0? + ¥*) = by... .ccceececeeees (xxxiii bis). 
The nature of this transformation is shown in Figure 5. The sample point G 
transforms into G’, « = 0, v= 1, and two points P,, P, on a contour become Py, Pr. 


where 
O'M/ =u,=tanGOP,, P/M =v,= 0G/P.M, 


O'Ms =%=tanGOP,, P/M =v,= 0G/PM. 


Il 


The field outside the contour in the (a, c)-plane corresponds to that outside the 
contour in the (u, v)-plane and it is over this that D” is to be integrated. 


13— 
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On comparing these results in the (u, v)-plane we see that 

(1) the density fields for P»,, and P, are the same, namely that given by 
(xxxii), but the contours (xxxii bis) and (xxxiii) differ both in the factor (n —2) 
for n, and in the additional constant term — 2 log (s,*/a,*) contained in the former ; 


9 


(2) the density fields for P, and P,,, differ in the term v"~* of (xxxii) and 
v"-8 of (xxxiv), but the contours are identical (xxxiii). 


In large samples, for a given = and II, the three tests become practically 
equivalent since the values of the three P’s will scarcely differ. They each result 
however from different methods of approaching the problem, and in small samples 
will lead to somewhat different results. It is the quantity P, which has been 
tabled. 


We shall now consider whether it is possible to obtain from the (a, o)-field, 
measures of the probability that the population, from which = has been drawn, 
has 


(1) a mean a, such that a, < a < ay, irrespective of the standard deviation ; 


(2) a standard deviation o, such that o, < o < a2, irrespective of the value of a. 


Again, as likelihood can only be associated with a single population with 
unique values of a and o, we cannot speak of the likelihood of populations for 
which (a, o) falls within the regions of the field defined above, although we might 
speak of the probability & posteriori of such an event, assuming ¢(a, 7) to be 
constant. Whatever be its exact interpretation, however, the integral of D in (xxix) 
taken over these regions and divided by J (the integral over the whole field) will 
provide us with some form of measure of the probability of the hypotheses we 
are considering, and will be a quantity varying’ between 0 and 1. 

First integrate D, therefore, along a strip of breadth da at a distance a from 
the axis of a: 


p © n 


da D .do =constant x da | o-"e 2? 
0 


0 


((X - a)? +-8?} 


do. 


Write w=n {(X —a)?+s*}| and u*/2o7=v, so that —dou?/c*=dv and the 
integral becomes 
© n-3 


constant x u~"+! da | ev 2 dy 
“0 


Il 


= constant x u-"*' da, 
the constant being a function of n only. 
Dividing by J, and writing z=(X—a)/s, we have finally for the measure 
of probability that the mean of IT is such that a, <a< ay, 


zs 2! 
P(a,a,)=¢,|  Berrerererere | | 


where z,=(X —a,)/s, 2,=(X —a,)/s. 
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} Except that we have now n — 1 instead of n this expression corresponds to (xi), 
and its value may be obtained from the tables of Student's z or ¢ function, using 
n—1 instead of n*. 


Now integrate D along a strip of breadth do at a distance o from the 
axis of a: 
ni ns” 
} do | D.da=constant xo" e 2% de. 


@ 





On dividing by J we have for the measure of probability that the standard 
deviation of II is such that o,<o <6, 


ow, - == . 
P (o,, 0.) = ca| ate Wde ....... J cedatwe teewens 


a 


. 2 . j ; . 
By making the transformation Vns/o= y, this becomes 


di _ 
P (Hi, eo) = l Pree ay] we“ dy ...(xxxvi bis), 


which is the difference of two ordinary y* integrals and can be obtained from the 
x’ Tables by taking yw? = y*, n’ =n — 1. 


If in equation (ix), where the variable is s, we write wv =Vns/o, we obtain a 
probability integral P, of exactly the same type as (xxxvi), except that there 
will be a term W"~ instead of w"-*, that is to say, the y* Tables must be entered 
with n’ =n. 

We have reached, then, three pairs of corresponding results, (xxi) and (xxxi), 
(xi) and (xxxv), (ix) and (xxxvi); in each case there is this difference of a unit 
in n which appears to result invariably on proceeding by the inverse method. 
Probably most statisticians will prefer to use the criteria (xxi), (xi) and (ix) 
of the direct method, which with the addition of the present P, Tables can 
be readily expressed in numerical terms. The difference of the two is however of 
no great importance except in very small samples, where final conclusions will be 
drawn in any case with some hesitation. 

(8) Analysis of Church’s Samples from a Skew Population. 

Dr A. E. R. Church very kindly placed at our disposal one of his sets of 
sampling results previously obtained for another purpose+. These were the 1000 
samples of 10 drawn from a smooth distribution with the following frequency 
constants : 

o =3°569,207, ,='219,333, ,=3°157,676. 

This distribution not differing very greatly from a Type III form may be 
taken as typical of the moderately skew distributions frequently met with in 

* Student’s original table of z=m/s for n=2 to 10 is given in Tables for Statisticians and Bio- 
metricians, p. 36 (second edition). A complete table for n=2 to 21 with auxiliary tables for use with 
higher values has been published in Metron, Vol. v. p. 3, 1925. These latter tables must be entered with 


t=zNn’—1, where n’ is now taken as the size of sample. 
+ Biometrika, Vol. xvi. pp. 321—324. This is Church’s Population “ B” and group of samples “IV.” 
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“normal” formulae. 





It is therefore of considerable importance to discover how far the ) 
variation in the frequency constants can be represented by the appro- 
In practice we may often not have available sufficient 


information to test whether the population II be normally distributed or no, and 
even if this be available may be unable to deduce the appropriate formulae for 


TABLE I. 


Distribution of Frequency Constants for Church’s 1000 Samples of 10. 
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samples from skew distributions. To what extent, then, shall we be likely to draw 
faulty conclusions if we apply the normal criteria in these cases ? 


Church has fitted a normal curve with standard deviation of o/Vn to the 
distribution of means, and finds on applying the test for goodness of fit, a value 
for P of ‘5758 *. By taking the square roots of his values of s?, we have obtained 
the distribution of s given in Table I. This is compared in the next column 
with the theoretical distribution appropriate to samples from a normal distri- 
bution with the same standard deviation, viz. ¢ = 3°569,207. The distribution of 
s? for a normal population is of course a Type III curve, and the theoretical 
frequencies for s were found by taking the areas between the corresponding 
ordinates of the s* curve from the Tables of the Incomplete Gamma Function. 
Testing for goodness of fit, y* = 26°12 and with n’=19, P=-097. The larger of 
the differences in frequency are rather irregularly scattered, but there appear on 
the whole to be too few low and too many high values of s among the obser- 
vations. This means that the observational distribution is more skew than that 
of normal theory, a point brought out from a comparison of the frequency con- 
stants given in the same table. The constants for normal theory are taken from 
the table given in Biometrika, Vol. x. p. 529. The standard error for o, was 
obtained on the assumption that the theoretical distribution of s was normal +. 
The ratios of the differences between the observed and theoretical values of the 
four frequency constants to their standard errors are + 1°0, + 1-2, + 1°69 and — 0:2 
respectively; the disagreement except perhaps in the case of V8, cannot be 
considered very serious, although we are probably just reaching a degree of 
skewness in the population for which the normal theory begins to fail. 

Compare now the observed distribution and that of normal theory for z = m/s. 
They are given in Table I; here y*= 29°19, n’=18 and P=-033. The most 
serious discrepancy, contributing 9°98 to y°, occurs in the group for z=—°6 to —°7, 
and there is no doubt that among the observations there are too few with large 
negative and too many with large positive values of z The observed 2-distri- 
bution is in fact significantly skew, whereas Student’s distribution is symmetrical. 
A comparison of the frequency constants is given in the same table. Judged by 
the standard errors, it is only V8, that differs significantly from normal theory}. 

* Biometrika, Vol. xvi. p. 334. 

+ For low values of 8, the table (Tables for Statistictans and Biometricians, Table XXXVII) giving 
the standard error of this quantity cannot be used, as in this region the second order term is as great 
or greater than the first order term which vanishes when §,=0. It is the first order term which is 
tabled. We have therefore compared the values of VB, for which the standard error to first order is 
equal to 

{8,384 — 68, + 3818+ 226, + 9}3/VN 
(where N is number of samples), and have inserted values for 8, and f; obtained from the theoretical 
8, and fy, on the assumption that the distribution can be represented by a Pearson curve. 

{ The standard error of JB, was calculated from the formula given in the preceding footnote ; 

this gives to the first order for samples from symmetrical distributions, 
(S.E. VB,)2=(81 — 682+ 9)/N. 
In the case of Student’s distribution for n=10, By=49°0. 
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Church’s population distribution was negatively skew, and this results in positive 
skewness in the distribution of z. 

In practice the method of applying the test consists, when the observed value 
of z is large, in rejecting Hypothesis B with a degree of confidence which increases 
as P, decreases. Taking P, equal (to the nearest figure) to (a) 05, and (6) °005, 
we find from the observations, 


(a) we positive values of z: 57 cases out of 1000 instead of 53, 
a 


» hegative _,, a “ =e 7 53, 

(b) f “0 positive ” »” : 6 ” ” ” 0, 
) . oo 

" (5 negative ,, a ey 7 7 w ) 


This difference must not be passed over, but it is doubtful whether the know- 
ledge that P, was really 03 (or ‘06) rather than °05, or again -002 (or -006) 
rather than ‘005, would in fact ever modify our judgment when balancing the 
probabilities regarding the origin of a single sample. 

Finally we may consider the distribution of the observations among the P, 
contours. Figure 6 is a reproduction of a rough working diagram containing 
entries for the 1000 doublets (m, s). The scale attached to the margins is that of 
the ratios m/s and s/o, where o is as before the standard deviation of the popu- 
lation. Eight of the P, contours are drawn in the figure; the observed regression 
straight line of s on m has also been given. It is obvious from inspection that 
the condition which holds for samples from a normal distribution of no correlation 
between m and s and of homoscedasticity is now far from true. The coefticient 
of correlation calculated from the observations is 7 = + 3206 with standard error 
‘0191. The theoretical correlation between m and s*, which gives a fair approxi- 
mation to that between m and s, is *3036*. The correlation ratio of s on m is 
n = "3495. In spite of this obvious disagreement with the distribution of the 
normal (m, s)-field, it is found that the frequencies of sample points lying outside 
the successive P, contours follow very closely the expected numbers. A com- 
parison is shown at the end of Table I; the only disagreement of importance is 
that for the contour, k=°55, outside which 321 observations instead of 299 lie. 
The y*-test may be applied to the frequencies between contours, for which there 
will be 9 groups including that for observations inside k = °45 and for observations 
outside k="90; here y*= 7:36 and P =-50. 

The P, contours can no longer of course provide a completely satisfactory 
criterion, since they have ceased to divide on a logical basis the sample points for 
which Hypothesis A is more probable from those for which it is less probable. 
They appear however to enable us to control the first of the two sources of error 
described in the introductory section+, since they do enclose approximately the 
expected proportions of random samples drawn from TI. Whether the situation 
would be as satisfactory for samples of a different size, or for different population 
distributions, we cannot of course say without further experiment. 


* Calculated from the formula (68) given in Biometrika, Vol. xv1t. p. 479. 
| See p. 177 above. 
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(9) Illustrations of the use of the Tables of P). 

The tables are given in the Appendix with a description of the method by 
which they were computed and of how they may be extended. They must be 
entered with n, the size of sample, and k; in the following examples k has been 
obtained from one or other of the diagrams, Figures 13 and 14, pp. 236, 237, in 
which the marginal scales are (1) M, the ratio of the deviation in the sample mean 
from the population mean to the population standard deviation o, and (2) S, the 
ratio of the sample standard deviation, s, to o. 

Haample 1. 

Measurements on 884 Egyptian skulls (XX VI—XXX Dynasties)* gave for the 
distribution of cephalic index, Mean = 75-06, Standard Deviation = 2°68. Would 
it be justifiable to consider that the 10 skulls with cephalic indices as follows, 
741, 77-7, 744, 740, 73°8, 72°2, 75°2, 78°2, 77-1, 78:4, were a random sample from 
this population ? 

(a) The distribution of cephalic indices is in general found to be symmetrical 
and leptokurtic ; we shall assume, pending the completion of further inquiries, that if 
8, = 3°4 to 35, the P, contours are still adequate. It is found that the mean 
cephalic index is 75°51 and s = 2°059, from which M = + ‘168 and S=-768. Using 
Figure 13 (the sign of M is immaterial) it is seen that the sample point lies just 
inside the contour / = °50; entering the P, Table of p (see p. 238) with n= 10 and 
k=°'50 it is found that P,=°'504. It is therefore possible to say that judged 
by cephalic index alone there would be no reason to doubt that these 10 skulls were 
a random sample of XX VI—XXX Dynasty material t. 

(b) Now apply the same test to the following 10 cephalic indices: 66:7, 
69:4, 67°8, 73:2, 79°3, 80°7, 649, 82-2, 72-4, 78:1. Here X = 73-47, s=5°942, 
M =—°593, S=2:217; the sample point falls near the contour k=1°60 in 
Figure 14, and for n=10 this corresponds to P, less than ‘0001. From the 
position of the point in the diagram it will be seen that the divergence is mainly 
due to a very large value of s compared with o; the mean is not exceptionally 
divergent from the supposed population mean of 75°06. We should conclude that 
it was very improbable that the 10 skulls were a random sample from the Egyptian 
population, and that probably they consisted of heterogeneous material, the 
variability being greater than that generally found in a sample from a homo- 
geneous race. 

(c) Suppose that we had been given only the mean cephalic index and not 
the standard deviation of the population. We could ask whether it was likely 
having regard to the observed variability of s = 5-942, that a random sample of 10 
could have been drawn having a mean differing by 1°59 from the population 
mean. Applying Student’s testt it is found that z=— ‘268 or t=z Vn —1=—*803 

* Biometrika, Vol. xv1. Tables II and III, pp. 337, 338. 


+ Actually the cephalic indices are those of the first 10 of the Farringdon Street skulls measured by 


Miss Hooke. Biometrika, Vol. xv. p. 54, Appendix A. 


t Using either of the Tables referred to in the footnote to p. 197 above. 
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and P,=P,=°'221. That is to say so large a deviation of z in excess or defect 
from zero would occur in 44 per cent. of random samples. Consequently the 
evidence does not contradict Hypothesis B*. 


(d) If however we had known from other sources that the standard deviation 
of cephalic index within a single race lay within the range 2°5 to 40, we could 
have improved upon the z-testt. Although o be unknown, we could say that the 
sample point in Figure 14 must lie somewhere on the line z = +268 which joins 
the origin, M = 0 =S, to the corresponding division of the radiating 2-scale given 
in the right-hand margin of the diagram. The exact position will depend on the 
value given to o, but if we might infer that this is unlikely to be greater than 
4-0, we could say that it was very unlikely that the sample point lay farther to 
the left along the z-line than the point for which S= s/o = 5942/4 = 1-485. To the 
right of this point the contours cutting the line have k > “68 and therefore P, < ‘08. 
We should therefore argue that it was far less likely that the hypothesis as to 
mean cephalic index of population were true than the value of P, obtained in (ce) 
would suggest. 


(e) If there had been no @ priori information as to o, it would still have 
been possible to locate the sample point on the line z=:268. By taking 
o =Vm? + 8° = 6:13, or M?+ S?=1, we should locate the sample at that point on 
the line where it touches a contour of the system. This will roughly be that with 
k =-47, giving P,= "689. Consequently were it quite possible for a homogeneous 
race to have a standard deviation of cephalic index as great as 6:0, we should 
have no reason for doubting the origin of the second sample of 10. This is an 
illustration of the line of reasoning followed in Section (6) above. 

Example 2. 

Records of weight in a large population of mice for males between 120 and 
140 days of age show a mean of 23°823 grammes, o = 3137 grammes, and for the 
frequency distribution 8, =*086, 8, = 2°687. 

Can the groups of 6, 4 and 5 mice with the following weights be considered 
random samples from this population ? 

(a) 22°5, 26°0, 20°5, 24-0, 18°0, 245. 
(b) 21-0, 21:0, 20°0, 20°5. 
(c) 22°5, 23:5, 23°5, 25°0, 24:5. 


(a) n=6, M=—°395, S='851. Using Figure 13, we see that k =*52 approxi- 
mately, giving P,=°607, so that there is no reason to doubt the origin of the 
sample. 


* If the inverse argument discussed in Section (7) above is used, Student’s Tables would be entered 
with n=9, and this gives P,=*223 against ‘221, a difference of no importance as far as interpretation 
is concerned. 

t The following values for the standard deviation of cephalic index will be found in recent papers 
in this Journal: XXVI—XXX Dynasty Egyptian 2°68; Naqada 2-80; 17th Century English 3-26; 
Farringdon Street 3°48; Australian ‘*A” 3°88, 
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(b) n=4, M=— 1-020, S=-132. The point is beyond the outermost contour 
of the diagram but this suggests a value for k of 21. Using the exact equation for 
k, or 

k = "434,294 (M? + 8?) — 2 log,, S, 
it is found that k = 2218 and P,=°002. This small value of P, is chiefly due to 
the low value of S. 
(c) n=5, M=—-007, S='278. Here k=1:15, P,=-036. Again the value 


of S is unusually small. 


The values of P, in (b) and (c) are small enough to make us search for an 
alternative hypothesis to that of pure chance fluctuation. As a matter of fact 
while (a) was a randomly chosen sample, in both the cases (b) and (c) the mice 
belonged to a single litter, so that the individual weights were correlated partly 
from the effect of inheritance, and partly because weight is correlated with size of 
litter. 


Example 3. 

The tables may be used in cases where the normal curve gives an approxi- 
mation to the binomial. 

A count of over a thousand words taken at random from daily newspapers gave 
a proportion of ‘6691 monosyllables. Out of 35 words chosen at random from the 
works of 12 English authors, the following numbers of monosyllables were found*: 
Kipling 26, Thackeray 26, rags 24, Meredith 24, Carlyle 23, Scott 23, 
Stevenson 22, Conrad 22, Borrow 22, Ruskin 21, Bryce 19, Hazlitt 12. Do these 
representatives of literature either appear to use a lower proportion of mono- 
syllables or show among themselves a greater diversity in proportion than would 
be expected to occur in random samples from a vocabulary in which the pro- 
portion of monosyllables was ‘6691 ? 

The appropriate binomial has n= 12, p=°6691, ¢ = ‘3309, and therefore a 
mean of 23°42 and standard deviation of 2°784 monosyllables. The §8,=-015 and 
B,= 


2-957, so that the fit by a normal curve should be satisfactory. The 12 samples 
ok 35 
=]: 


eis a mean of 22:00 and standard deviation of 3°559, giving M=—-510, 
278, and k="60, P, ='121. The P, would be very much larger but for the 
wot from Hazlitt+, and there appears in this particular character to be no 
significant difference between the language of the 12 writers and that of the 
newspapers. 


Example 4. 


Another type of problem may be illustrated as follows. A number of laboratories 
are perhaps determining by chemical analysis the percentage of some constituent 
substance in a given material. Take as an example the hypothetical results 

* The 35 words were the first on the last line of each of 35 consecutive pages. 

+ A further random sample of 100 words taken from Hazlitt’s Characters of Shakespear’s Plays 
contained 42 monosyllables, corresponding to 14°7 out of 35, and suggesting that there may really be 
some significant difference in Hazlitt’s vocabulary! 
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given in the following table; for each laboratory there is a mean per cent. m, 
based on nm analyses; the weighted mean of the whole is 8353 per cent. and the 
combined standard deviation of the 187 analyses 329 per cent. Information is 


TABLE II. 























Laboratory 1 2 3 4 & 6 7 8 9 10 11 
Mean per cent., m; | 8°432 | 8°127 | 8-483 | 8°002 | 8°317 | 8°582 | 8-239 | 8°531 | 8-163 | 8-405 | 8-391 | 7-976 
No. of analyses, 2, | 22 10 | 19 15 16 25 9 16 4 | 16 23 
(m,—8°4) Vino + °46 Medex ‘wees ¥ 4°69 | — 1°01 | 4+-2°77 | — 1°47 +159 |—- 1:44] +-06] —-13]— 4-47 
| | 
| | 











required as to whether, (a) the results are consistent with a quoted trade percentage 
of 84 for the constituent substance, (b) the variation in means is no more than 
would be expected to arise from chance fluctuations, or (c) that variation points 
to systematic differences in the method of analysis or in the material distributed 
among the different laboratories. If the variation among the 187 observations is 
simply due to chance errors, then « = ‘329 may be taken as a fairly close measure 
of the standard deviation of the universe of possible variations in analysis which 
has been sampled; we may therefore test the hypothesis that the sampled popu- 
lation mean is 84 and that the grouping of the observations into 12 sets is of a 
purely random nature, in the following manner. The standard error of each 
laboratory mean will be o/Vm, and, if the hypothesis be true, the 12 quantities 
(m, — 84) Vn,/o given in the last row of the table should each be a chance 
fluctuation from zero reduced to a standard error of unity. On the assumption, 
which could be verified in a given case, that the distribution of individual analyses 
is not exceptionally anormal, the distribution of these deviations in mean may be 
taken as normal. Consequently if we calculate the mean and standard deviation 
of the figures in the last row of the table, we can test the hypothesis by taking 
these as M and S, and finding & and P, with n = 12. 

It is found that M=—-821, S=2°192, k=1°'70, and P, is less than ‘0001. 
The hypothesis is therefore extremely improbable and from the position of the 
point on Figure 14 it is clear that this is mainly due to the high value of S. 
Comparing the difference between the weighted mean of the 187 observations 
(8353), and 8-4 with its standard error of o/V187=-0240, we find a ratio of 
— 1°94, and a deviation as great as this in excess or defect might be expected to 
occur in 5 per cent. of random samples. The variation among the means them- 
selves is however quite inconsistent with the hypothesis of chance fluctuations 
from a general mean, and must probably be attributed to variation in technique 
or to lack of homogeneity in the material distributed for analysis. 

Example 5. 

It is of some interest to compare different methods of approaching the problem 
of determining whether two samples can have been drawn from the same popu- 
lation. Suppose that the following figures were the results of an experiment 
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in Industrial Psychology. A piece of work has been carried out by one group of 
30 workmen according to Method I, and by another group of 40 according to 
Method II, the groups being selected at random from a factory. Are the recorded 
distributions in times such as to justify the conclusion that Method I is the 
speedier of the two ? 


TABLE III. 























Time in seconds 50 | 51 52 | 53 54 | 55 | 56 | 571 58 | 59 | 6O | Totals 
Nae ——t— a os an 
Frequency Method I | 1 3 | 5 faa 5 | 3 l 1 | — 30 
Distributions | Method II | — l 2 5 8 9 6 3 3 l 2 10 
} 








It is found on calculation that, 
for I, mean, Z%, = 53°700 secs.; standard deviation, s, = 1°882 secs. 
fori. .. G=55 175 sees: a Es 8, = 2°072 secs. 


The usual method of procedure is to compare the difference in means with 
the standard error of that difference, and to refer the ratio of the two to the 
appropriate probability tables. The value given to the standard error and the 
tables used will depend entirely on the line of reasoning employed in the final 
process of inference. 


(a) We may compare %,—%, with Vs2/n, + 82/2, and from the ratio, — 3°107, 
obtain $(1+a)=*9991 from Sheppard’s Tables, and from this conclude that if 
the samples came from two populations with the same mean but standard 
deviations of s, and s,,so great a difference in mean, positive or negative, would 
only occur in about two samples in 1000. 


(b) Having regard to the mean values of s, and s, in sampling we may attempt 

to obtain estimates of o, and o, which in the long run are likely to be closer than 
/ ° ° 

s, and s,, namely : s, Vn,/(n,—1) and s, V nol| n.— 1), substitute these in the standard 

error, and hence obtain a value of the ratio = — 3060 and a resulting $ (1+ a) ="9989. 


(c) We may give to the result, (6), the inverse interpretation suggested by 
Rhodes*, 


(d) We may approach the problem by making use of the principle of likelihood 
and reaching the test given by Fishert. Suppose that we have reason to believe 
that two samples have been drawn from normal populations with the same 
standard deviation o, but that it is necessary to compare the relative probability 
of two hypotheses, (1) that the samples come from identical populations with 
mean at a, and (2) that while identical as to variability there are two different 
means @, and ds. 


* Journal of Royal Statistical Society, 1926, Vol. uxxxtx, p. 544. 


f Metron, 1925, Vol. v. p. 7. Statistical Methods for Research Workers, 1925, p. 109. 
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Then it is easy to show that the maximum likelihood of hypothesis (1) is 
associated with a population for which 


M42,+ NF , 1 7 a N,N __ An 
=——— > ao= Sn + NoSo + —— (2, —Za)*t » 
N, + Ns N, + nz | m+n 
Ny +N 
ee ee 
giving LT (max) « e : 


omtn, 
The maximum likelihood of hypothesis (2) is associated with a population for 
which 


a - lo N48 + NeSx 
@,=%,; @,=%,; o* = —_—— 


2 > 


Ny + Ny 
1 Ny + Ny 
~~ 
and [’ (max) « —=—e 7 
ao” + Ny 


As a measure of comparison of the two hypotheses we may take the ratio of 
these two maximum likelihoods, or 
M+ Ny 
L (max)/Z’ (max) =(0'/os"t*™=(14+2) 2 


where 


~ 
< 





The smaller be z, the less likely becomes the hypothesis of a single population 
compared to the alternative hypothesis of two populations differing in mean, and 
consequently we can take z or a function of z as a criterion to use in judging the 
probability that the two samples have come from the same population. It is 
most convenient to refer z to the sampling distribution that it would follow were 
hypothesis (1) true. This Fisher has shown to be 

_% +N - 1 
y=y (1 +2) * 4 


so that in using the test we have only to enter Student’s Tables with 


t=Vn,+n,—2.2 
and n’ =n, +n, —1*. 

In the present example z = — 3662, t = —3°020 and P,= P, =-9982. 

This extension by Fisher of Student’s test would seem to be of considerable 
value in dealing with small samples. Its limitation lies partly in the assumption 
of normality, but as the z-distribution is not very sensitive to changes in the 
population 8’s this will often not be serious; and partly in the fact that it does 
not take into account a possible difference in population standard deviations, 
It assumes without question that these are the same and that there is only 
uncertainty as to the means. In many problems however there will be & priori 
reasons for believing that there can be no significant difference in variability and 
it is in such cases that the test is applicable. 


* See footnote to p. 197 above as to notation. 
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(e) Finally it is possible to make use of the P, Tables in the following manner. 
Having observed the two samples, the single normal population of maximum 
likelihood is that associated with hypothesis (1) of the preceding paragraph, (d), 
namely in the present case a population with a=54543 secs., o = 2°123 secs. 
It may now be asked whether the two series of observations are likely to have 
arisen independently in random sampling if this had been the population sampled. 
It is found that 

for I, n=30, M=—°397, S='887, k='51, P, =°079, 
for II,n=40, M=-—-‘298, S=‘976, k=-47, P, =:'201. 


That is to say in the most favourable light possible, the combined probability of 
occurrence of the two samples may be measured by the product ‘079 x ‘201 = ‘016. 
For any other values assigned to a and o the probability of the observed result 
measured in this manner will be less, and therefore we should feel considerable 
confidence in arguing that the hypothesis of a common origin for the samples is im- 
probable. This method does take into account possible differences in variability, and 
used with discretion would seem to be of value. It would of course lead to some 
difficulty of interpretation should the resulting product of probabilities be, let us 
say, ‘05 to ‘20; if in the most favourable circumstances the hypothesis of a 
common origin is just possible, are we to discard it on the grounds that we have 
no right to postulate the most favourable conditions? A satisfactory answer to 
this question is probably impossible. At any rate the method is perhaps one 
which encourages an attitude of caution. 

In the present example, however, whichever test be used, it is clear that we 
should conclude that process I does almosé certainly lead to a really quicker timing 
than process IT. 


IlI. SAMPLING FROM A RECTANGULAR POPULATION. 

(10) Sampling Distributions of the Frequency Constants. 

The population, II, may be defined by two quantities, the range w, and the 
distance of the centre of this range from a fixed origin, g. Then all values of the 
variable between g— }w and g+4w are of equal frequency of occurrence in II. 
In a particular sample, =, consisting of x observations X,, X,,... X,, the range 
may be denoted by W and the distance of the mid-point or “centre” of this range 
from the fixed origin by G, that is to say the lowest and highest variates of the 
sample are G—}$W and G+3W respectively. The fundamental space for samples 
of x drawn randomly from II will be an n-dimensioned hypercube each of the 
sides of which is of length w, and having as a diagonal the line joining the points 
(g— 3u,...g—4w) and (9 + }w,...g+4w). As in the “normal” space, this 
diagonal forms the axis of the n! similar regions of which one is that defined by 
the condition , 


n> 


REC Sc ER Vkiancciicane cee (ii bis), 


where a, = X,—g + 4w, the origin for the z’s being taken at one corner of the 
cube, 
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The density field within the hypercube will be uniform, for the chance of 
drawing a sample with observations in the limits X, + 4$dX,,...X,+4dX,, is 
dX,,...dX,/w". Making use of this density field, Hall* has shown that the 
distribution of the means of samples of n is given by a curve consisting of n 
connected arcs of degree n—1, having (n — 1)-point contact at their joins. The 
distribution of standard deviations seems likely also to consist of a number of 
connected arcs. 


In testing Hypothesis A we cannot make use of contours of equal density 
as these do not exist. Nor, except from the established custom of regarding 
them as the-most important frequency constants, is there any reason why the 
mean and standard deviation of the sample should be chosen as the most suitable 
estimates of the population parameters. Consider the problem from the point of 
view of the criterion of likelihood. Suppose that there are reasons for believing 
that = has been drawn from some population following a rectangular distribution, 
and that an alternative to II is a population Il’ with centre at g’ and range of w’. 
Then 

Likelihood of I1/Likelihood of Tl’ = w’"/w*. 


For a given &, the population of maximum likelihood is that for which w’ is a 


minimum. But w’ cannot be less than the sample range, W, and if it has this 
value clearly g =G. It follows that in the fundamental hypercube, the contours 
upon which 

= Likelihood of II/Likelihood of IT’ (max) 


is constant are the contours of constant range. 


If therefore we make use of the criterion of likelihood in forming a judgment 
on Hypothesis A, we need consider only the range of the sample, and our confidence 
in the hypothesis will increase as Ww. If W>w or any of the observations 
in the sample fall outside the range of II, then of course the hypothesis becomes 
impossible. Remembering that we are only considering cases where the sample 
has come from some rectangular population, the use of range alone as a criterion 
is seen to be perfectly reasonable. This limitation in the form of the sampled 
population appears more serious than that involved in the previous part of this 
paper, simply because populations of normal or approximately normal form are 
found in experience to be common, so that we can frequently assume normality 
without great risk of error, while it is more difficult to conceive of problems in 
which a rectangular form of population can be assumed @ priort. There is little 
doubt however that useful conclusions would be reached by applying the tests 
developed below to samples from any nearly symmetrical population with sharply 
limited range—that is to say,a population for which f, and £, lay in the neighbour- 
hood of 0 and 1°8. j 


* Biometrika, Vol. x1x. pp. 240—244. 
+ There it was assumed that = came from some normally distributed or nearly normally distributed 
population. 


Biometrika xxA 14 
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Je will now consider the distribution of Wand G@ in sampling from II. In 
W g 

Figure 7, OJ is of length w, and A and B represent the lowest and highest obser- 
vations of the sample. 





® 
f 


2 Se > 





re) A B 
} | 





Fic. 2 
The chance of obtaining one observation in an element dx centred at A, 
another in an element dz, centred at B, and n — 2 between these two values is 


dz In—2,\" da Ln — 2,\""? 
ee = : y= da, dx. 
w w w w" 


G=4(a,+a%,)+9—4u, 


W =n —- %, 





Change the variables to 1 


so that dG dW =dz,dz,. 
Then the chance of obtaining a sample with centre and range in the limits 
G+4dG, W+i4dWis 
$,(G, W)dGdW =kW"/w" .dGd W.............065 (xxXxvil). 
Choosing & so that the integral of @ taken throughout the total range of 
variation is unity, it is found that k=n(n—1). Hence 
$,(G, W) =n (n—1) W/W" «00... cece eee (xxxvii bis). 
Integrating out for G between the limits g —4(w— W) and g+3(w— W) we 
obtain the chance, ¢,(W)dW, of drawing a sample with range in the limits 
Wi dW or 


leslie W 


o.(W)=n(n—1) = 1 -- =) sia actaeeeewen (Xxxvlil). 


This is a Type I curve (and also part of an (nx — 1)th order parabola) with the 
mean at (n—1)w/(n +1), and a range from W=0 to w*. For the reasons given 
above we therefore take as our criterion the probability integral 


W 
Py -| $.(W)dW = {nw —(n —1)W} W""/w” .........(Xxxix), 
0 
The chance of drawing a sample with its centre in the limits G+4d@ is 
obtained by integrating (xxxvii) between the limits for W of 0, and w—2|@—g|t or 
hs (GZ) = n (w— 2 |G —g Pw"... eeceeeeeeeeeeerees (xl), 
a symmetrical curve made up of two portions of (x — 1)th order parabolae placed 


“back to back” on the ordinate through G=g. The form of distribution is shown 
roughly in Figure 8. 


* This expression for the special case of n=2 has been given by Borel: T'raité du Caleul des Prob- 
abilités, Tome I, Fasc. 1, p. 20, 1925. 


t Throughout this paper |q| denotes that the expression q within the rules is to be given a positive 
sign, 
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The chance of drawing a sample with centre at a distance as great or greater 
than | G —g| from the centre of the population will be 


g+hw : 
Po=2| n{w—2(G—g)}""/w" .dG = {w —2|G—g)}"/w” ...(xli). 
Finally we may find the distribution of z’=(@—g)/}W. Making the trans- 
formation 2’ = 2 (G —q)/W, we have Wd Wdz’ =dWdG, and from (xxxvii), 
$d. (2, W) = fn(n—1) WP Jewr® ....... 00... ccc cceee (xlii). 


For a given value of 2’, W can increase from 0 until {W+ |G@-—g| = 4w, or 
W=w/(1+{2'|). Integrating out for W between these limits, we have 








Gs (2) = 3(m — 1) {LH ] 2 [Pr 00... ccc ec ceees (xliii). 
\ i 
——— Fu oi Bi ——___—+ 














Fic. 8 


This distribution of 2 = (G@—g)/}W is of considerable interest. It corresponds 
to Student’s distribution 
ti 


J (2) = constant x (1 + 2°) 2 


for the ratio z= m/s for samples from a normal population. 2’ is the ratio of the 
distance of the centre of the sample from the population centre to half the sample 
range, that is to say, like z it is the ratio of a measure of location to a measure of 
scale. The distribution of 2’ is independent of the population range w just as that 
of z is independent of the population standard deviation o. It can be shown that 
z’ is constant over parts of n(n —1) primes passing through the centre of the 
fundamental hypercube and making an angle with its diagonal #,=a,=...=a 


, 


which depends only on the value of 2. 


n> 


These primes correspond to the conical 


z contours in the normal space. 


14 


9 
2 
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Suppose that we have a problem in which it is wished to know whether it is 
likely that a given sample = (W, G) has been drawn from a rectangular popu- 
lation with centre at g, but for which the range, w, is unspecified. We are now 
testing Hypothesis B. Following the argument employed in dealing with normal 
populations, it is found, as shown in Figure 9, that the ratio of (1) Maximum 

















(origin) 


' 2 (3 w+|G-g| ) ' Rangde of population of maximum 
q . likelikood with cerfre at g: 
G - 
oo : Range of sample ard of population 
Ww of maximum likelikood (erestricted) 
Fic. 9 


likelihood for a rectangular population with centre at g, to (2) Maximum likelihood 
for any rectangular population, is 


wns W +2 | G -9 |)" a {] + | Z o-*, 


This expression is constant along the contours of z’, and therefore leads to the 
probability integral of (xliii) as the criterion to employ in testing Hypothesis B, or 


P;=(n-1 )| OG +sy"*de = +s) "" «2.5 aeons (xliv). 


If the 2 distribution is applicable only to exactly rectangular populations, its 
interest is mainly theoretical, but if it car be shown that it will represent 
adequately the distribution of the ratio (deviation in centre)/(half range) for 
small samples from a wider range of symmetrical populations, it will be of greater 
value. The test in this form or in its inverse form discussed below would enable 
an estimate to be made of the probable limits within which lies the mean of a 
symmetrical limited range population, from a knowledge of the two extreme 
individuals of the sample only. This it would do in the same way as Student’s 
test does from a knowledge of the mean and standard deviation of the sample. 
Some further consideration of this point is being undertaken. 


Fisher has given the limiting forms of the distributions of range (xxxviii) and 
centre (xl) when x is large*. He did not consider the case of small samples, but 
used his results as an illustration of the way in which the method of maximum 
likelihood picks out the “statistic” G as the estimate of the population mean 
which has the minimum standard error. If the estimate is to be based in this 
way on two observations only, it is of course necessary that the grouping should 
be fine, and that there should be no possibility that either of the extreme obser- 
vations is subject to error of measurement or of record. 


* Phil, Trans. A, Vol. 222, p. 348. 
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(11) Solutions obtained by the Inverse Method. 

As a population is completely specified by the position of its centre g, and 
range w, the problem can be considered in the (g, w)-plane. This is represented in 
Figure 10. 
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B is the sample point (@, W), and P represents a population (g, w). For it to 

be possible that = has come from II, it is necessary that 

g—iw<G-4W<G+tw<g+h, 
or that —h(w-— W)<g-G<i(w- W), 
or that w>W+2\g-G\. 

Consequently the (g, w)-field is limited to the region inside the acute angle 
TBT’, where ZTBD= 2T’'BD= tan“ 4. The likelihood of a rectangular popu- 
lation represented by P is LZ « w~", and we may give to the field a point-density 
d=w. This is constant along lines such as CDC’ parallel to the axis of g. 
L itself provides a criterion by which to judge the probability of Hypothesis A, 
but we obtain a more readily comprehensible scale by taking the ratio of 
(1) I’, the integral of d throughout the portion of the field TCC’T"’, to (2) J, the 
integral over the whole field TBT’. This gives, after simple integration, 

Py, =TI'/I = {(n-—1) w—(n— 2) W} W"/w ............ (xlv), 
and provides a measure not only of the probability that the range in the 
sampled population is >w, but also ‘from the standpoint of likelihood it is the 
criterion to use as to the probability of Hypothesis A taken as a whole. 

Finally we may consider the probability that the centre of the sampled 
population is such that 


|g -—G\| >a, say. We shall obtain a measure of this by 
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integrating d over the two corresponding regions of the field, and dividing the 

result by J. It is found that 

Qiq-—G\)- , ie 

aa ; , where | g — G| =a ........000. (xlvi). 

Compare now (xlv) with (xxxix) and (xlvi) with (xliv), remembering that 

2'=2\|G@-—g\|/W. It will be seen that by changing n —1 into » in the solutions 

of the inverse method, P,, becomes identical with Py and P, with P,. Con- 

sequently for a given sample (G, W) and hypothetical population (g, w), these 





P,= {1 + 


pairs of criteria will give almost identical numerical values unless the sample is 
very small. The distributions of range and of the ratio 2, (xxxviil) and (xliii), 
were in fact first obtained by one of us from the integrals P, and P, of the 
inverse method, making the change from n—1 into » from analogy with the 
solutions in the normal field! 


(12) Illustrations from Experimental Sampling. 

In practical application the weakness of the methods discussed—and indeed of 
any method of dealing with small samples—lies in the fact that while questioning 
the position (as measured by m or g) and the scale (by o or w) of the sampled 
population, they provide no test of the shape of the distribution. They assume 
that the 8, and 8, of the population are known to be 0 and 3, or 0 and 1°8, as the 
case may be. In Section (8), analysing Church’s samples of 10 from a skew popu- 
lation, we have considered from one direction what is the risk of error involved in 
making these assumptions. We shall now consider this from another direction. 

Two sets of random samples from a rectangular population were obtained * : 

(a) 500 samples of 4. Tippett’s Random Sampling Numberst were used, the 
distribution of the population being divided into 21 groups. As a check on the 
random nature of the sampling, the distribution of the total 2000 observations 
was compared with the “expected” distribution, namely one containing 95°24 
observations in each of the 21 groups. Applying the (P, y*) test for goodness of fit, 
it was found that y?= 16°08 and P =°712. 

(b) 540 samples of 10. These were obtained by drawing 10 coloured beads 
from a box containing 50 beads of each of 11 different varieties. The beads were 
replaced and shuffled after each draw of 10. The population was not therefore 
strictly “infinite,” but as M/=55n the effect is hardly likely to have been of great 
importance. Unfortunately it was not realised until afterwards that the sampling 
was biased because the various types of beads were sufficiently different in size 
to make the chances of drawing beads of the different colours not always equal 
to 1/11. The distribution of frequencies among the 11 groups for 4200 beads 
was 409, 446, 362, 404, 355, 407, 342, 422, 363, 375, 315. The “expected” 
number in each group is 381°8. Applying the (P, x’) test, it is found that 

* (a) were drawn by Miss M. Page, (b) by Miss Page and two students of University College, London, 


Miss F. E. James and Mr T. Uchida. The work of computation has been mainly Miss Page’s. 
+ Published as No. XV of the Tracts for Computers, Cambridge University Press. 
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x’ = 39°71 and P = ‘00002. The samples must therefore be considered as drawn 
from some irregular and slightly skew population, and no very detailed analysis of 
the results seemed to be justified. 

Taking the experimental sampling (a), we may first test the formulae of the 


preceding section. 


The Distribution of Range, W. 





In sampling, an observation fell into one or other of the 21 equal groups 
into which the population rectangle was divided. An exact range for the sample 
was not therefore obtainable, but it seemed that an adequate test could be 
obtained by taking the population range as w= 21, and by supposing that samples 
whose highest and lowest observations fell into: 

(1) neighbouring groups, (2) groups 2 apart, (3) groups 3 apart, ... (20) groups 

20 apart, 
had ranges, W, falling into groups 
(1) 0-15, (2) 1:°5-2°5, (8) 2°5-3°5, ... (20) 19°5-21°0. 
The theoretical distribution of W obtained from (xxxviil) is 
me FEE a EE anise evinicsincenerecend (xlvii). 

Taking its integral over the corresponding grouping ranges for W, we have 
the set of observed and theoretical frequencies shown in Table IV. The (P, x’) 
test makes y? = 13°43, which for n’ = 17 gives P = "641. 

The Distribution of Centre, G. 

The origin for G was taken at the centre of the middle or 11th population 
group; the values of G were either integral or contained half units. 


A value of G,“0” was taken as lying in a group ranging from — 0°25 to + 0°25, 


« Py ad OF 7h 
” >» ‘0 v0 ” ” ” » +0 25 » +0 fo, 
‘ )” 75 O95 
»” ” ‘] 0 » ” » ” os 0 fo 5 + 1 25, 
GOR cesses 


The theoretical distribution of G obtained from (xl) is 
gad (1 — 2 |G /RB PSI ........0.cereceecesses (xlviii). 


Taking its integral for the required groups, we obtain the series. of observed 
and theoretical frequencies for G given in Table IV. These results lead to 


5 


x’ = 17°31, which for n’ = 21 gives P = °633. 


For the mean and standard deviation of G, we have 


Observation Theory Standard Error 
Mean — 0950 0 1212 
Standard deviation 26778 27111 ‘0857 * 


* Approximately, on the assumption that the distribution of G is a normal curve. 
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The standard deviation of the sampled population is 21/(2 V3) = 6-0622 ; con- 
sequently the standard deviation of the mean in samples of 4 is o/Vn = 30311, 
illustrating Fisher’s point referred to above that G provides a better estimate of 
the population centre or mean than X, the sample mean. 


The Distribution of 2’ =2(G— g)/W. 
Using (xliii), we see that the distribution of z’ is 
ROE EE IE acvnsksciemnoreenaccoomanl (xlix). 


Integrating this expression within the grouping limits shown, we obtain the 
observed and theoretical frequencies given in Table IV. Here y?=12°64 and 
n’ = 14, so that P = 477. 


The agreement of observation with theory for these three distributions is 
therefore very satisfactory. We may consider now the further problem. There 
exist_a great variety of limited range symmetrical (and unimodal) distributions 
with values of 8, lying between 3°0 and 1°8; it is very unlikely that, however 
inadequate our information, we should suppose that the population from which a 
sample has been drawn to be normal when in fact it was rectangular. But we 
can obtain some idea of the degree to which the normal criteria will fail for 
samples from populations with intermediate forms—say with 8, 2°0 to 26— 
by testing the adequacy of the normal theoretical distributions in this extreme 
case of 8, = 18. 


abl . . . . > . . . /é 
Take first the distribution of range. For a rectangular distribution w= 2 V3e ; 
it follows from (xxxviii) that the frequency constants of the sample range, W, are: 





ae - 
atta eno 

n+1 n+1 
= w 2(n—1) [6e pay |" 
Standard Deviation = ae xf ee = 5 oT ae r 
: 2(n+2)(n —8Y 6 (n+ 2)(n?— 2n4+ 38 
B, = ; B.= 


(n—1)(n+ 38) ’ ~ (n—1) (n4+38)(n+4) 


The ratios of mean range and of standard deviation of range to the standard 
deviation of the population, and the #, and f, of the range distribution for 
samples from a normal population, are known*. These values are compared with 
the true values in Table V, for n = 3, 4, 6,10 and 20. For the smaller samples 
there is a fair degree of correspondence in the mean range, but as » increases the 
ratio for the rectangular distribution tends to 2 V3 = 3°4641 while that for the 
normal distribution increases slowly but without limit. Even at n= 3, the standard 
deviation ratios have little correspondence, while the 8, and f, values show no 


agreement, except in giving increasing skewness as increases. 


* L. H. C. Tippett, Biometrika, Vol. xvut. pp. 364—387. The values given in the table were actually 
taken from the summary given in Biometrika, Vol. xvit. p. 192. 
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TABLE V. 


Comparison of Frequency Constants of Range. 



































Size of Sample 
Population a 
3 j 6 | 10 20) 
Mean Wie ie Rectangular l 732 2-078 2°474 2°834 3134 
\ Normal 1-693 2°059 2°534 3-078 3°735 
| 
Standard Deviation { | Rectangular ‘775 693 553 “386 217 
of Wiao L| Normal 888 "880 *848 “797 729 
8 f Rectangular 0:000 “082 “356 “773 1°265 
ig “* \ 1 Normal ‘417 273 ‘189 "156 “161 
3 { | Rectangular 2°143 2°357 2-880 3°648 4°569 
7 oa | Normal 3286 37188 3170 | 3°22 3°26 
| 


Consider next what would be the result of supposing the sampled population 
to have been normal, and applying Student’s z-test to measure the probability of 
Hypothesis B. Does the distribution of z= m/s for these samples follow approxi- 
mately the law /(z)=constant x (1+2*)-?? A comparison of observation and 
normal theory is given in the fourth section of Table IV. It is found that 
x” = 10°77, which for n’ = 10 gives P =*293. The main discrepancy, which is not 
serious, lies in some large negative values of the observed z’s. The observed 
standard deviation is o,= 1:3030 against the normal value of 1/Vn —3 = 1-0000. 
As the standard error of ¢, in the normal case is infinite, owing to the 4th moment 
of the z-distribution being infinite, it is not easy to estimate the significance of this 
difference. Judging only from the y?-test, however, Student’s distribution appears 
quite adequate to describe the variation in z, as far as the evidence from these 
500 samples goes. 

It must be remembered, however, that even if z calculated for samples from a 
rectangular distribution follows Student’s law, P, has changed its meaning as a 
criterion ; it no longer enables us to reject or accept Hypothesis B upon the same 
logical basis. As we have seen, a more suitable ratio to employ would now be 
2’ =2(G-—g)/W. But is z highly correlated with 2? If this were so, in judging 
from z we should reach results not very different from those based on z’. We 
have calculated the coefficient of correlation between z and 2’ for the 500 samples ; 
it has the very high value of +9756. These results therefore suggest that for 
samples of 4 from a rectangular population, (a) the distribution of z= m/s follows 
Student’s law with sufficient approximation for ordinary purposes, and (b) the 


criterion P, will be closely equivalent to P,. Consequently, if we assume 
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erroneously that the sampled population is normal when in fact it has a 8, and f, 


on or cl 
not be 


for 


ose to the stretch of the 8, axis lying between 8, = 3-0 and 1'8, we shall 
led into serious errors of judgment by using Student’s test. The results 


samples of 10 considered below suggest that for this case also point (a) holds 


good, and since as n increases the distribution of m/s tends to that of m/o or to a 


normal 


curve with standard deviation o/Nn whatever be the form of the popu- 


lation sampled, the most crucial test of correspondence (except for the cases 
i I 


n=2 and 3) would appear to have been made at n= 4. 
correlation between z and z 


On the other hand, the 
2’ which is perfect at n = 2 will decrease as n increases, 


and the mean and standard deviation of the sample are less and less influenced 


by the 


will the 


Central values 


values of she two extreme observations. The interpretation of the z-test 


refore become more and more doubtful. 
TABLE VI. 


Distribution of z in 540 samples of 10. 















































. | 
Less than | om = yr | ie’ : r | “AF ‘ “| ry 
of z ~ 1°80 ee es 1 H36| 1 —1 6 then bk ead i a, —“to 
| | 
iis ‘Gia eens ea = a ee i” eee 
Observation ae a a ee 1 1 | t+ beta 5 
Normal Theory 0-1 01 Ol 01 0-2 0-4 0°6 o-9 | 1°5 | 2°5 | 4°2 6°8 
| 
| | | 
| 65 55 | —-45 | —°385 | —-95 | --15 | -—-05 | +°05 | 4°15 | 4°25 | 4°35 | 4-45 
| | 
| ae, |S re ——— at a ee 
ie | | aaa | | 
Observation . | 11 13 AR) 29 44 53 71 55 86 47 29 18 
Normal Theory | 10°9 16°8 25°1 35°4 16°3 56°1 61°9 | 61°9 | 561 16°3 | 35:4 25°1 
| | | th 
= | i = a Greater than | m 4. 
+°55 | 4-05 | 4-75 | 485 | 4-00 p15 | 115 [ares far , + 1°40 Total 
= | Pes aes ee! =~ - | a = = 
‘idetesibins 19 | Rene. | 3 | 1 —s _ 540 
Normal Theory 16°8 | 10:9 | 68 | 4:2 | 2-5 15 | 09 | 06 0:4 0-6 540°0 
| | | | | | 
We 


of 10. 


shall now consider briefly the experimental sampling (6), with 540 samples 
As stated above, no very detailed analysis is justifiable. Since the population 


was only divided into 11 groups, it was not possible to obtain satisfactory distri- 


butions of 
calculated. 


as exac 


means, 


, 


W, 
Owing to the biased sampling, the population could not be considered 
tly rectangular. 


G@ or The mean, standard deviation and ratio z were, however, 


In order to correct to some extent the error involved, the 
m, were measured not from the centre of the middle or 6th population 


group, but from a point — ‘19 from this, corresponding more nearly to the mean as 


observe 


The 


ave shown in Table VI. 
P =:027. 


d from the samples. 


» observed distribution of z and the corresponding theoretical frequencies 
16 makes 
A comparison of the frequencies shows that this is due to a tremendous 


The fit is bad, giving y* = 27°16, which for n’ = 
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contribution of 15°94 to x? for the group z= +°l to +°2. But beyond this, the ! 
observed distribution of z is somewhat skew, having 8, = ‘395 and 8, = 4988. Now 

this cannot be due to the error involved in assuming that the z of samples from a 
rectangular population follows the distribution appropriate for a normal population, } 
for in both cases the distribution of z must be symmetrical. it is probable that the 
asymmetry as well as the great discrepancy in the group z=+-°1 to +°2 results 

from the irregularity of the bead sampling combined with some effect on the 
grouping of z introduced by the correction to m referred to above. 

Using the only other measure of comparison, the standard deviation of z, 
we find: 

from observations o,=°3707 ; from theory o,=°3780, 

with a standard error which would, if the distribution of o, were normal, be ‘0113. 
It appears therefore that as far as the scale goes, the observed z distribution is 
represented by the “normal” value of o,, or 1/Vn —3. Further sampling would be 
necessary to determine the form of the curve*, but as the probability integrals 
of symmetrical curves with the same standard deviation change very slowly with 
Bz, it seems very probable that for n = 10 Student’s distribution will still represent 
adequately the distribution of z from a rectangular population. The interpretation 
of P, in testing Hypothesis B is again of course doubtful. 

While the distributions of 2 appear in both these series. of samples to be 
represented approximately by Student’s curve, the normal P, contours are found 
to be quite inadequate. This is partly because the regression of s on m is no longer 
linear, and partly because the distribution of the s-margin cannot be represented 
even approximately by the normal s-distribution. 


The standard deviation of the variance, s*, is given by 


—] ; 2 
sonst) J 9,-%4 ais 
nz n—l 


where £, refers to the sampled population. Further o,=o,/20 fairly closely for 
large samples, though with increasing approximation as n decreases. It follows 


TABLE VIL. 




















, 
| Observations from Rectangular Population | Theory for Normal Population 
n Mean s/o| o,/o | B, | By | Sk. |Mean s/o} o,/o B, Bo Sk 
| 

+ —|—|— 

Y "822 275 | °006 | 2°379 | — ‘074 -798 337 | °236 , 3°108 | +°270 
| 10 932 "148 | ‘O70 | 2°761 | - ‘174 923 220 | -063 | 3011 | +°128 | 
| | 











that o, is very sensitive to changes in B,, and the drop from 3:0 to 18 in B, 
completely alters its value. Also, as the results in Table VII show, the observed 

* In view of the complex nature of the distribution of m as found by Irwin and Hall (Biometrika, 
Vol. x1x. pp. 225 and 240) it seems unlikely that the theoretical distribution of z would be easy to find. 
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distributions of s for the samples from the rectangular population are skew in the 
opposite sense to those from the normal population ; the skewness increases with n. 


In view of this complete lack of correspondence, except perhaps for the mean s, 
it is not surprising to find that the P, contours are here of little value. Placing 
down upon each (m, s)-distribution three contours, the results of Table VIII below 
were obtained. The observed frequencies are in all cases less than those expected 
on normal theory, so that in using the P, contours we should be inclined to under- 
estimate the significance of a divergence in the sample point. At 8,=1°8 we have 
reached a position quite out of touch with the normal field; it is hoped shortly to 
provide information on the position at an intermediate point, say B,= 2°5. 


TABLE VIII. 








| n=4 n=10 
| | 
| Observed Observed 
k frequency outside | P, x 500 k | frequency outside P, x 540 | 
| contour contour 
} 
SS eee eee Te eee See eee } 
| | | 
‘70 | 138 196°5 “50 | 179 271°9 

1:00 | 48 68°8 ‘70 19 33°38 | 
1°50 1] 12°1 ‘90 | 2 4-2 | 
| 








IV. SAMPLING FROM AN EXPONENTIAL POPULATION. 


(13) Sampling Distributions of Frequency Constants. 


The frequency distribution of the population may be written 


fase nantes | 
ae 9 (1), 


where b is the distance from the origin to the start of the curve, b+ the distance 
from the origin to the mean, and o the standard deviation of the curve. In the 
fundamental space of II, we take rectangular axes through the point (6, b, ... b), 
and writing «=X —b, the field in which ¥ may lie is that for which all the «’s 
are positive. The appropriate density with which to fill this field will be 

_ Sz) NE 


D=a-"e ° 


SS . ” mticce aa menneee ee eteeeee wee(li) 
where @ is the distance of the sample mean from the start of the population curve. 
The contours of equal density will be the m contours, which are primes in the 
n-fold space lying at right angles to the’diagonal axis 
CH wey Aenean ES en eRe (iii bis). 
We shall now find the contours in this space upon which the ratio 


» = Likelihood of IT/Likelihood of TI’ (max) 
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is constant. For an exponential population, II’, with start at b’ and standard 
deviation o’, we have 


n(X-Dd’) 
La oe ©. paettiileeaeitsaasneomiireanes (Iii). 
aI _n(X - b’) 
4 , . > . . . 
Whence —; oc no’—"—e ° , showing that for a given value of o’, L is a 


ab 
maximum when X —D’ is as small as possible. Clearly this will occur when b’=X,, 
the lowest of the sample variates, for it is impossible for = to have come from II’ 
if X,< U’. 


n (X —b’) 
OL ’ Pa . y } . . : 
Further -— « no'"~e *  {X —b-o’}. This expression vanishes when 
do 
a) o =0 making L = 0. 
§ 
(b) oc =a » L=0. 


(c) o = A-¥ iS Lo (X — o)*e*. 
The last is the maximum solution; it follows that for a given sample, the 
exponential population of maximum likelihood is that for which 
b= X,=b+42,; o’ =X —~X,=f£-x2,. 


Hence the ratio X is given by 


ee eco tii) 
o 


This result shows that the method of likelihood picks out % the mean, and a, 
the lowest observation, as the two descriptive measures of the sample; that is to 
say it suggests that we should take X,=b+a, and /=X —X,=%—a, as the 
sample estimates of the population parameters band o respectively (see Figure 12). 
In the fundamental space, is constant upon the contours 

(a, + l)/o — log, (i/o) =k =1—(1/n) log, Ar....... 2. cee ee ee (liv), 


and this will be the system to use in testing Hypothesis A according to the criterion 
of likelihood. The problem can be referred to the («,, /)-plane, and it is necessary to 
find the appropriate density, d, for this field. This can be obtained most readily by 
obtaining first the (#,, @)-field. 

The chance of drawing a sample from IT with the n observations in the limits 
a, + 4a, ... t,t }day is 

7 NX 
Cda,...da,=a%e * da, ...d&,. 
Take now as variables 
— ee eS ee 


Z, = a, the lowest observation of sample, 
Sg Hy, Bg = Oh, 000 Sq Bus 
Hence 
Zo 


Cda,da, ... da, = Cdz,dz,dz,...dz,=a-"e ° dz,dz,dz,...dzn. 

















‘VW Py 7 ~ 7 
J. NEYMAN AND E. S. PEARSON 223 
For a given set of values of 2), 2, 24, ... Zn, the variable z, cannot be less than 2, 
and takes its highest value when 2, = #, = 2,, and consequently 
24, + 3+ Ht... +2, =NB, OY 2, = % — 22,— 2%... —Za=h- 


Hence 


uy 
Cdz,dz,dz,... d2n=dz,dz,dz,...dzna-"e °| dz, 


=dz,...dz,0"e °(%—3z,—2,...—Z,): 
Now for given values of 2, 2, 2, --. Zn, the variable z, can vary from z, to 
Uy = 2) — 32, — 25... — Zn, corresponding to the case where both «, and a, have their 
lowest possible values of «,. 
Hence 
_ *o tls 
Cdz,dz,dz;... dz, = dz)... dZ,0~"e 7 (2 — 3z,— 2... — Zn) dz, 
4 
_% 
=dz,...dz,a-"e 1° $(2,—42,— 25... — Zn) 


Proceeding in this way we can integrate out successively for all the variables 

Zs, 2) ««» 2n, and shall be left with 
= “0 
Cdz,dz,=dzadz,0"e ° (2,—nz,)"*/(n —2)!. 

This process of integration will have covered only 1/n of the complete funda- 
mental space, namely that portion in which #,= 2, is the lowest observation. There 
will be n—1 similar regions in which ay, #, ... or a, is the lowest. To obtain the 
chance that the mean of the sample lies in the limits @ + }dz, and that the lowest 
value of n undifferentiated observations lies in the limits #, + }da,, we must mul- 
tiply the above expression by n and write z= n%, z,= #,, giving 

Nx 
d, (&, #,) dtda, = a" (n"/(n — 2)!)(@—a,)""e 1° d%da, ......... (Iv). 

From (lv), integrating for «, between the limits 0 and @%, we find the distri- 

bution of means, %, or 


Ge) = 0 * (nM —1) 1) Bt cccesesnccscsssncseed (Ivi), 
a ‘Type III curve corresponding to a special case of the result obtained by Church *, 
Next, upon integrating out for between the limits 2, and #, we obtain for 
the distribution of the lowest observation in the sample, 
i NX) 
DER RERRI ssenesiiertionnsniion ...(Ivii), 
which is another exponential curve, with standard deviation o/n. 


* Biometrika, Vol. xvi. p. 336, giving the distribution of means in samples of x from a Type III 
population. 
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Putting in (lv), €=/1+4,, we obtain for the frequency function of , and J, 
n(l+2,) 
d, (a, D=ao* (n*{(n—2)!) Pe nee e ee se eeees (viii). 

If then we know that the sample = has been drawn from some exponential 
population and we wish to judge the probability that it has been taken from II, 
the suitable criterion to take is P,, or the integral of a density d= ¢,(a,, 1) taken 
over the portion of the (a, /)-plane lying outside the member of the system (liv) 
passing through the sample point, (a, /). 

The (a, /)-plane is represented in Figure 11, OA, OB being the axes of J and a. 
G is the point #,=0,l=c, representing the sample for which II would be the 
population of maximum likelihood. By differentiating (liv) it is seen that 


d Zo =t 











dl . 
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and consequently all contours cut 4 line /=constant at the same angle. The 
differential coefficient is zero along the line /=o, and for very divergent contours 
tends to + 0 and —1 at the two peints of intersection with the axis of l. C repre- 
sents asample }; P, is the integral of the field taken outside the curve through C. 
We have not attempted to compute this. 


—s on 


which we have reason to believe has been drawn from some exponential population, 


Consider now Hypothesis B. Having a sample =, with observations X,, X, 


we wish to measure the probability that it has been drawn from a population 
whose “start” or lower limit is at a distance b from the origin. The parameter, o, 
of the population is unspecified. Then, as before, we consider the ratio of 

(1) the maximum likelihood for an exponential population starting at b, 
to (2) the maximum likelihood for any exponential population. 
The former corresponds to the population for which b'=), o’ =X —b, the latter 
for which b’ = X,, o’ = X — X,, as found from (li). 
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The position is shown diagrammatically in Figure 12, where the observations of 
the sample are shown as black circles, and the exponential ci.rves of the two hypo- 
thetical populations have been dotted in. The ratio of the two likelihoods becomes 


(X a b)-” e~"/( x at x,y" e7-? = (1 a *1\" ; 


ey 
This is constant along the radiating lines, such as OCT in Figure 11, upon 
: oes , &,-b. , x ; 
which the ratio 2” = 2,/l = 7 is fixed. We may therefore take these as the 


appropriate contours to use in testing Hypothesis B. It remains to find the distri- 
bution of 2”. Writing in (Iviii) #, = lz”, we have 

_al (1+2”) 
od; (2”, 1) dz” dl =a-" (n"/(n — 2)!) "ee eGR, Gest ostes (lix). 
\ \ 
, \ 
\ . 














\ » Sample Meare 
\ \ 
\ » 
N a 
‘ > 
‘ ~N : 
> ~ 
~ ioe 
r _ Ni festa Oe, 
Qt —+|+—x,—+eee e : Ld ae Dots representing 
LA RS: sample of 2 ( 6) 
. 





Fic. 12 
This must be integrated for values of 1 between 0 and 2. Writing 
t=nl(1+2’)/o, 
we find 


ds (2”) dz” =(1 + 2”) dz” [ t”“ et dt/(n — 2)!, 


or he (2) = (2 — 1) (LA 2) ™ «0... ccrceeeereesenceeseeees (Ix). 
It is easily shown that 
Mean 2” =1/(n — 2). 


Standard Deviation of 2” = V(n — 1)/(n — 2)? (nu — 8). 
Further we obtain 


Py= “(n —1) (1 + 2) da” = (1 + 2° )rF™ 00... ccecoees (1x1), 
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which may be used as a criterion for testing Hypothesis B. It is comparable with 
and very similar to P,, (xi), and P,, (xliv), corresponding to the normal and rect- 
angular cases. z, 2’ and 2” are all ratios of a deviation in the measure of position 
to the measure of scale obtained from the sample. 

These results which have been expressed in terms of #, and / are again what 
may be termed the “ideal” results for random sampling. Unfortunately in practice 
two difficulties often arise : 

(a) There may be slight errors in observation, or variation in the character 
measured in an individual, which are not due to chance fluctuations. 

(6) The unit of measurement or the grouping unit with which data have been 
tabled may be too coarse for the value of a, to be determined with adequate 
precision. 

Under such conditions a method which provides a criterion depending upon 
the measure of a single variate, such as #, will be at a disadvantage compared 
with one which depends upon an average value, such as the mean or the standard 
deviation. In the latter case the effect of a few badly measured observations, a 
few disturbed variations or of coarse grouping is likely to cancel out. There are, 
however, we believe, problems to which the methods of the present section can be 
applied, and one of these in connection with random intervals we shall take as an 
illustration in the section that follows. 

It has not seemed necessary to develop the inverse solution considered in 
dealing with the normal and rectangular populations, but the working would be 
quite straightforward. 

(14) Test for Random Intervals. 

Whitworth, in Choice and Chance*, has shown that if the intervals of time 
between events which happen once on the average in time ¢ may be considered 
as random, then the chance after an event has happened that another will not 

t 
occur in an interval ¢ is given by e If y=/f(t) represents the frequency distri- 
bution of the random intervals between the events, then 
t t 


> a " 1 i 
e =| F(t) dt, or f(j=-e -. 
Jt e 


Morant*+ has extended this idea to the case where there is necessarily a closed 
interval of length 6 after an event before another can occur. 


The frequency distri- 
bution of intervals will now be 


Y = —C sev ncccccrcreccceceseccesceceeece (xii). 


( 
Suppose that we have v=n+1 consecutive observations of the occurrence of 
an event, recorded at times ¢,, ts, ts, .. 


. t), and that the nm intervals or differences 
between the ¢’s are 7, 2 


», +++%n- Let 2, be the shortest of these intervals. Let us 


* See p. 200, Prop. ut. 


+ Biometrika, Vol. xu. pp. 309—337. 
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imagine in the first place that we have reason to believe that the intervals are 
random, but that there is between each a closed period of length b. We wish to 
test from the observations whether this hypothesis as to b is reasonable, but are 
not interested in the average time c. The problem is exactly that of testing 
Hypothesis B. With the previous notation 





x, =%,—), 
}L =( ++... +%)/n—- ts = (ty — tb) /n—-%,, 
ls” =z,/l, 


\p.=(1 +g’) et, 


In following this method it is important to be clear exactly what interpretation 
may be placed on the value found for P,,. A large value (say 1 > Py > 0°5) will 
not imply that the intervals (less 6) are random. This result will follow whatever 
be the form of distribution of intervals, provided only that ¢,— is small compared 
to the average interval. A large P, merely suggests that if the intervals (less 
the closed period) are random, then it is reasonable to suppose that the closed 
period is b. On the other hand, a very small value of P,, suggests that either ‘the 
supposed closed period b is incorrect, or that the intervals (less 6) are not random. 
In cases where it is clear & priort that there can be no closed period, then a small 
P» certainly suggests that the intervals are not random. 


If however it is possible to observe a number, say N, of series of x consecutive 
intervals, we may calculate 2” in each case and compare the distribution of these 
N values of 2” with the theoretical frequency law (Ix). If there is agreement, then 
not only is it probable that the hypothesis as to b is correct, but also that the 
intervals (less b) are random. As 2” is independent of c, the average “rate of 
happenings ” of the events, it follows that if b is unchanged—and in particular is 
zero—we may combine together in the test, the 2’’s of different series of n ob- 
servations, even though this rate may change from series to series. The test is 
also applicable, however small x may be. The practical importance of this will be 
illustrated below. 


Through the kindness of Miss E. M. Newbold and Professor Greenwood, we 
have been able to make use of some of the original data regarding Factory 
Accidents used by the former in Report No. 34 of the Industrial Fatigue Research 
Board*. We may ask what answer the 2”-test gives to the following question : 
Are the lengths of interval between consecutive accidents incurred by the same 
individual worker distributed randomly? For this purpose use has been made 
of the cards of male workers in a Chocolate Factory during the year 1923+. The 


time of accident was recorded to the day and even hour, and by making allowance 


x 


‘A Contribution to the Study of the Human Factor in the Causation of Accidents,’ 1925. For a 
later paper by the same writer, see Journal of Roy. Stat. Soc. 1927, p. 487. 

t These are the cards of the series M1 of page 11 of the I.F.R.B. Report, No. 34. The injuries were 
mainly of minor character, i.e. ‘‘cut finger,” ‘‘ abrasion,” ‘‘splinter,” etc. attended to in the Ambulance 
Room of the Factory, and did not lead to absence from work. 


15—2 
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for Sundays, Bank Holidays, and as far as possible extra days of absence, sickness, 
ete.*, it was possible to obtain fairly accurately the number of days which each 
worker was exposed to risk between accidents. As Miss Newbold has shown, the 
liability to accident varies considerably among the workers, varying from none to 
fifteen accidents in the year for the cases examined, but the ratios 2”, which are 
independent of this liability or rate, can be combined together. 


The cards have been analysed for 50 workers; of these, 25 incurred only three 
accidents in the year, giving two intervals or the minimum to which the test can 


be applied. 


TABLE IX. 


Intervals between Accidents. Distributions of 2”. 





N=2 
2” | Observation 

j— 5 20 
5 1°0 7 
1°0— 1°5 t 
1°5 D*() 2 
2-*)0— 25 2 

2-5— 3:0 2°5 
SO— OO | 5 

40 50 1°5 

b°0— 6°0 _— 
CO— 70 | l 

i 0— 80 | — 

80— 90 | 

| 9:0—10°0 — 
| Greaterthan) 5 
10°0 (| 7 
Total 50 


Theory 


16°7 
8°3 
5°0 


et = DO = bo 
LS iw) 


tm ~16 


Difference in terms of standard error 


| 
| Mean 2” | 
| 
| 


nu=4 


Observation 


0 “3 4 
I— *2 8 
2— °3 4 
| 8— l 
| 7) l 
| *5—1°0 3 
| 1-0 1°5 3 
| 1°5—2°0 l 
| 2-0 50 — 
| 5°0—S°'0 — 
| ae | 
| 
| ‘Total 25 
“460 


| 


Theory 





n=6 
ss | Observation 

o—] 4 
J—-4 10 
9-3 5 

“2 “J Ped 

| “J 5 ae 
| Greater ) 1 

than *5 § 

Total 20 

179 


a 


} 
| 
Theory | 


— et DO te = 


In the above table are given the calculated values of 2” for (a) the 50 cases, 


taking the first three accidents or 


=n 9 


n= 2, 


(b) the 25 of these for which there were 


at least five accidents (n = 4), and (c) the 20 of these for which there were at least 
seven accidents (n=6). The intervals chosen were the first two, first four and first 
six of the year. It is clear that as the records were restricted to a year, they do 
not contain a completely random selection of intervals, at any rate for the workers 


* As the dates of absence were not specified exactly, but grouped in thirteen 4-week periods, there 
was often some uncertainty as to the accident interval to which they belonged. But it was always 
possible to locate the longer periods of absence such as the 10—14 days annual holiday. 








20ry | 


50 
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of small liability. The table compares the observed frequencies with the theoretical 
obtained by integrating (1x) between appropriate limits; also the mean 2”, and the 
ratio of the difference between mean (observed) and mean (theory) to the standard 
error. In the case of n= 2, the theoretical mean and standard deviation of 2” are 
infinite so that the latter comparison cannot be made, but as far as can be judged 
from a sample of 50, the correspondence in the frequencies appears very close. 
For n=4 and n=6 the suggestion of too few observed values of 2” in the first 
group would need confirmation from fuller data*; the mean values of 2” are 


TABLE X. 


Distribution of 2” ; Omnibuses and Pedestrians. 











| 





























| Intervals between Omnibuses | Intervals between Pedestrians 
| 
| 
n=2 n=4 n=5 
| Ot See ee, eee ee. aes a re eee 
| ; 
2” Observation Theory | * Observation | Theory i | Observation | Theory 
| | 
| = _ —= —————SSa=_EE = _— 
| : 
0— ) { 9°3 O— 1 3 7°0 0—1 4 3°2 
5— 1°0 7% 4:7 1 2 1 1°8 ‘I—2 3 2°0 
1:0— 1°5 15 28 | 2 +3 2 3:5 2—3 _ 1°3 
| 1:5— 2-0 3 1-9 "S— 4 5 2°5 If 0°9 
2*Y— 25 2 13 “4 5 2 19 j—"5 l 0°6 
O05 70 — 1°0 5—1°0 6 4°8 I—'6 1 O-4 
| F0— 4°0 2 1°4 1°0—1°5 3 1°7 6—-"7 1 0°3 
| 40 a) 1 0-9 1°5—2°0 2 0°8 Greater } ie 1°3 
| 5:0— 6-0 2 0°7 -)— 50 2 0-9 than °7 § 
| 6<O yO an 5 5:0—S8'0 l Or1 — . = 
| 7:0— 8:0 Or4 Greater ) 1 : a a —_ 
| 8&0— 9:0 — 0:3 than 8-04 
| 9°0—10°0 0:3 — — — as 
| Greater than) . 95, ae coe = — 
10°0 { = " 
a = = _— oe 
Total 28 28-0 Total 28 28-0 Total 10 10°0 
| 
Mean 2” | _ — 1°258 *500 “210 *333 
Difference in terms of standard error +46 — “82 








however quite in agreement with theory as far as can be judged with such large 
standard errors. 

To obtain some further idea of the sensitiveness of the test, we have applied it 
to the time intervals observed between the passing of consecutive omnibuses on 
a London bus route. These intervals are not of course random, the omnibuses 

* In general there could be no closed interval, b, the times between accidents varying from 0 to over 


180 days, but it is possible that after an accident the worker is for a day or two more cautious, and 
this reduces the chance of very small intervals and therefore small values of 2”. 
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having started at fixed intervals (apparently of 3 minutes) from three-quarters of 
an hour to an hour before the observations were made. During this time traffic 
delays, etc. would have superposed chance positive and negative variations on to 
the scheduled intervals, so that in some cases these had been reduced to a few 
seconds and in others had been doubled. Table X gives the values of 2” calculated 
for 28 sets of n=2 and 28 of n=4. The fit for n = 2 is far less satisfactory than 
in the case of the accident intervals, and appears even worse for n= 4, where the 
mean 2” exceeds the theoretical value by 4°6 times its standard error. Finally in 
the same table are given 10 values of 2” calculated from sets of 5 intervals of 
time observed between pedestrians passing a fixed point on a roadway; here, as 
far as can be judged, agreement is satisfactory, suggesting that the intervals were 
random. 


There has not been time before going to press to extend these results, but 
inadequate as the numbers are, they certainly suggest that the method may be of 
value in testing the randomness of intervals in many problems, and further that 
the test will be sensitive even for pairs of consecutive intervals (n = 2) provided 
that the number of such pairs available is large enough. 


V. CONCLUSION. 


The main problem that we have considered is that of determining whether it 
is probable that a given sample, taken as a whole, has been drawn from a specified 
population. From the idea of the representation of a sample of n as a point in 
multiple space, there follows at once the conception that the discriminating criteria 
may be regarded as associated with surfaces in this space which divide regions 
containing points for which the hypothesis to be tested as to the sample’s origin 
is less probable from those for which it is more probable. Regarded from this 
point of view it is seen that in any given problem there need be no single system 
of surfaces which it is “best” to make use of. The system adopted will provide 
a numerical measure, and this must be coordinated in the mind of the statistician 
with a clear understanding of the process of reasoning on which the test is based. 
We have endeavoured to connect in a logical sequence several of the most simple 
tests, and in so doing have found it essential to make use of what R. A. Fisher has 
termed “the principle of likelihood.” The process of reasoning, however, is neces- 
sarily an individual matter, and we do not claim that the method which has been 
most helpful to ourselves will be of greatest assistance to others. It would seem to 


be a case where each individual must reason out for himself his own philosophy. 


The differences in method of approach have been illustrated in the two problems 
that have been described as the testing of Hypothesis A and Hypothesis B. Both 
problems suppose a knowledge of the form or shape of the population sampled, 
A supposes its standard deviation known and B does not. In beth cases there 
appear to be four or five methods open to the statistician; if properly interpreted 
we should not describe one as more accurate than another, but according to the 
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problem in hand should recommend this one or that as providing information 
which is more relevant to the purpose. The principles involved have been illus- 
trated in the case of samples from normal, rectangular and exponential populations, 
for each of which analogous tests exist. Most attention has been given to the 
normal case, simply because the f’s of the populations of experience do tend to 
cluster round the Gaussian point. New tables have been computed for what is 
termed the P, test of Hypothesis A, and illustrations of some of the various types 
of problems in which it could be employed have been given. 


Now these tests, whether applicable to the normal, the rectangular or the 
exponential population, have all assumed both that there is perfect random sampling 
and that while there is uncertainty as to position and scale there is none as to the 
shape of the population curve. In following what would appear a more efficient 
system of reasoning, it must be remembered that te ideal situation is not quite 
that of practical reality, and we must be careful that what is gained on the swings 
is not being lost on the roundabouts. It would be useless to emphasise the gain 
in relevance of a P,=°08 obtained from Student’s Tables over a P ='15 obtained 
from Sheppard’s Tables, if in fact the former were in error by an amount of the 
order of 05 owing to a faulty assumption that the population sampled was normal. 
Owing to the rapidity with which the distribution of means tends to normality, as 
n increases, the P obtained by entering Sheppard’s Tables with z=m/s has a 
definite meaning if properly interpreted, even if the population sampled be skew. 
But by using P, we may be obtaining a false impression of the accuracy of the 
criterion upon which our judgment is to be based. In practice in dealing with 
small samples it is generally not possible to be certain of the form of the popu- 
lation distribution from internal evidence. But from other considerations, from 
previous experience in similar problems and so forth, we may have good grounds 
for believing that the distribution is roughly of a certain type, and if skew for 
knowing the direction in which it will be more sharply limited. There will 
undoubtedly be regions in the 8,, 8. population field surrounding the normal, the 
rectangular, and the exponential points and any other points for which the problem 
can be completely solved, within which the criteria appropriate for these spots will 
be adequate. It is therefore a problem of first importance to ascertain what is the 
extent of these regions of adequacy, and failing a fresh advance in theory* the 
most straightforward method of doing this is to take experimental soundings in 
the 8,, 8, field. In this process we have not at present gone very far, but have 
shown that 


(a) At 8, =0°2, 8, = 3:2, the normal theory for samples of 10 does provide criteria 
which for practical purposes are probably adequate in the case of the distribution 
of m, of s, of z and for the P, contours. 


It is of course a serious limitation to the practical value of the method of likelihood, that it appears 
only to provide readily soluble equations in certain simple cases. It is not until the samples are large 
enough for the application of the (P, x2) group test that the problem can be solved in a more general 
manner. 
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(b) At B,=0, 8,=1°8, the z-distribution for samples of 4 and of 10 appears 
adequate, but the normal theory quite fails for s and P,. In this direction an 
intermediate sounding is required. 


(c) Some experimental work at present in progress suggests that the normal 
theory will also be adequate for s and z for samples of 5 and 10 from a symmetrical 
leptokurtic curve with 8, as great as 42, 

(d) The results obtained in connection with the exponential population are 
quite out of touch with the symmetrical cases, but the distributions reached in 
that case will probably be adequate for a fairly wide range of skew populations 
with 8, and 9, in the region surrounding the point (4, 9). 


The limitation implied by the assumption of perfect random sampling must 
not of course be overlooked; this is of particular importance in the case of very 
small samples. It has been shown that there may be some latitude as to the exact 
shape of the population curve associated with each hypothesis, but it has been 
assumed that in any event the sample is a random one. It is true that the causes 
underlying the bias or selection which occurs in the drawing of a sample from II 
may be such that the sample can be regarded as randomly drawn from II’; but if 
a biased sample from an approximately normal population can only be considered 
as a random sample from a J-shaped curve, it is clear that our methods of com- 
parison will be inadequate. The bias may consist only of a few errors of observation 
or of crudeness of grouping, but the smaller the sample the more important will 
be the effect. 


The answer to criticism from this source is that the tests should only be 
regarded as tools which must be used with discretion and understanding, and not 
as instruments which in themselves give the’final verdict. It is impossible to 
escape from the difficulties ; they make it desirable to avoid the use of very small 
samples whenever possible, but do not prevent some conclusions of value being 
drawn when fuller data are not available. If the criteria that have been discussed 
indicate that the supposed population of origin is improbable in comparison with 
some other alternative, that is a sign which is not to be disregarded. But we 
must not discard the original hypothesis until we have examined the alternative 
suggested, and have satisfied ourselves that it does involve a change in the real 
underlying factors in which we are interested; until we have assured ourselves 
that the alternative hypothesis is not error in observation, error in record, variation 
due to some outside factor that it was believed had been controlled, or to any one 
of many causes which when they chance to modify even one observation in a 
sample of five may be of serious consequence. 

We must not conclude without thanking Miss Ida McLearn for her diagrams, 
Miss M. Page, without whose assistance in computing the completion of the paper 
in its present form must have been indefinitely delayed, and Mrs L. J. Comrie who 
carried out the final stage in the tabling of P,. 
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VI. APPENDIX: TABLES OF P),. 


P, is given by the relation 





n 
5 n ro 12 
> mn” aaa ee ‘ 7. 
Py= rs Se G@MdS .....: (xxi bis), 
es nfs! 
= Var 2 I ( 9 


where the integral is to be taken outside the curve in the (M, S)-plane upon which 
X is constant, and 
_" (M?+S2-1) 


A=Se ? eR ee I (xix bis). 
We may write 


ey 
P10 Sao € vs» 


logy) X = . {log S?— (M2 + S- 1) logy, e! _ ; {lo : 


and obtain for the equations of the contours 
(BM? +. S*) logis ¢ — logy S? = K........cccceceveness (xx bis). 

Values of M for equidistant values of S* were first computed for the curves 
corresponding to a number of values of k between *45 and 2°40. Some of the 
contours have been drawn in Figures 13 and 14. The lowest possible value of k 
corresponding to the centre of the system, M=0, S=1, X=1, is 

b = loge = *434,2945. 

If S, and S, be the points at which a contour cuts the axis of S, if M, indi- 

cates the value of M at the point on the curve corresponding to a given S, and 


if we write «,=Vn M,, then the integral (xxi bis) can be put into the form 





S, n—1 
F nN Oo 
1 . n . oo ae ? ga 
4(1—P,)= _ S"-2¢ (RY (1xiii), 
5 n—d n— +) 7 : 
“yee —— 
.2? ra, 
fas 1 —ly? 
where A (1+ a,)= | - e da, 
J-oN2r 


and is obtained by entering Sheppard’s Tables with 2,=VnM,. The values of 
the integral (Ixiii) were then calculated by quadrature. The task of calculating 
P, for an adequate framework of values of & and for n=3 to n=50 would have 
been very great, but it was discovered that the ratios P,/X for a given k are very 
nearly independent of n. Table XI gives the values of P, actually computed 
divided by the corresponding . It was estimated that the computed value of P, 
might be in error by several units in the fifth decimal place, and this limits the 
number of significant figures that can be given for the ratio as k increases. It 
also means that the final figures in the tabled ratios are not exact. The results 
however appeared to justify the following procedure : 

(a) To obtain interpolated series of ratios for n= 3, 4 and 10 from the ten 
or eleven values computed in each case. 


* The argument interval for S varied from -005 to -04. 
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TABLE XL. 




















| | 
k n=3 n=h n=5 n=10 n= 20 n=50 
——_| —__—__| Si, ae 
loge 1°0000 1°0000 1°0000 1°0000 1°0000 1°0000 
“45 — | 1:0169 -- 1-0170 - 1:0170 
50 1:0743 | = 1-0736 1°:073 1-0732 1-072 
55 : | a 1°1327 . | 1185 
60 1:1991 | 1°1977 — 171959 1°195- a= 
65 a — 1:2622 — —— 
70 | - | 1°3363 — 1:332 | 1°33 ‘ 
15 | 14117 — —- | — — 
"80 | 1°4921 14899 1°4851 —_ _ 
‘90 | 1°6672 - 1°66 - — 
1:00 | 1*864 = 1°85 | — 
| 1°20 2°335 2°330 = _— 
| 1°50 3-286 ah a 
| 1-80 4-62 os — | 
| 40 | bie ss 
YOU — | _— | 





(b) To extrapolate for high values of & for n=10 and 4, from n=4 and 3 
respectively, in view of the fact that as P, decreases only a few significant figures 
are needed in the ratio. 

(c) With the help of these “frames” and the isolated values for n = 5, to inter- 
polate appropriate ratios for n= 5, 6,7, 8 and 9. 

(d) Since the ratios at n =10 give accurately the three values of P, computed 
for n=20 and the three for n= 50, to assume that the ratios for n= 10 will be 
adequate for all values of n in the range 10 to 50. 


mm . . v - 
lo obtain P, it was only necessary to compute log,» =5 (logy e —k:), add to it 


g g 
the logarithm of the ratio P,/X and find the anti-logarithm. The P, tables given 
below were calculated in this way. They are given to four decimal places, but the 
final figure may often be in error by a single unit, although not, it is believed, by 
as much as two units. After the work was commenced it was realised that the 
variable /; was not perhaps the best to have taken from the point of view of inter- 
polation in the tables; kh? would probably have been better. It has been necessary 
to change the interval from ‘01 to ‘05 at k=°65, and from ‘05 to ‘30 at k=1°50. 
For high values of n and low values of i: the tables are not easy to use if four- 
figure accuracy is required, but for most practical purposes it is sufficient to obtain 
P, to two decimal places only, and in this there is no difficulty. 

Method of entry. 

Having obtained from the data of the problem under consideration M = m/o 
and S=s/o, k may either be calculated from (xx bis), or generally obtained with 
sufficient accuracy from one or other of the Figures 13 and 14. P, is then found 
by entering the tables with n and k. Examples of the use of the tables have been 
given in Section (9) above. 
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Extension of the Tables. 
On the assumption that the ratios P,/X for n=10 will continue to hold good 
for any value of n greater than 50, we may proceed as follows: 
(a) Find the appropriate &, (b) calculate X from the relation 
n 
logy X =5 (log, e ~ k), 
(c) multiply the ratio P,/X given in Table XII below by X, and so obtain P,. 
TABLE XII. 
Values of P,/X for n = 10 and beyond. 























! 
k Py jk Py} k > |r k P,)r 
“435 1-0008 “500 1-0731 565 11514 630 
440 1-0062 *5O5 1-O0789 “STO 1°1576 635 
44S 1-0116 “510 10847 575 1°1639 640 
“450 10170 “515 1-0906 “580 1°1702 645 
“455 1°0224 “520 10965 585 1°1766 650 
“460 10279 "525 11024 “590 1°1830 v0 
“465 1-0334 ‘530 1-1084 595 11894 “Td 
| “470 1-0390 “535 1-1144 *600 1-1959 80 
| "475 1-0446 “540 1°1205 605 1°2024 85 
*480 1°0502 "54D 1°1266 “610 12089 “90 
“485 1°0559 550 1°1328 “615 172155 "95 | 
‘490 10616 55 1°1390 620 1°2221 1-00 
95 1°0673 “560 171452 "625 1°2287 
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N.B. I feel it necessary to make a brief comment on the authorship of this 
yaper. Its origin was a matter of close co-operation, both personal and by letter, 
5 I ) I 


and the ground covered included the general ideas and the illustration of these 
by sampling from a normal population. A part of the results reached in common 
are included in Chapters I, II and V. Later-I was much occupied with other 
work, and therefore unable to co-operate. The experimental work, the calculation 
of tables and the developments of the theory of Chapters II and IV are due solely 
to Dr Egon S. Pearson. 


J. NEYMAN. 




















